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INTERFERON-ALPHA POLYPEPTIDES AND CONJUGATES 



CROSS-REFERENCE TO RELATED APPLICATIONS 

Pursuant to 35 U.S.C. § 119(e), this application claims the benefit of U.S. 
Provisional Application Serial No. 60/572,504 filed on May 19, 2004, the disclosure of 
which is incorporated by reference herein in its entirety for all purposes. 

FIELD OF THE INVENTION 

The present invention relates generally to polynucleotides and polypeptides encoded 
therefrom, conjugates of the polypeptides, as well as vectors, cells, antibodies, and 
methods for using and producing the polynucleotides, polypeptides, and conjugates. 

BACKGROUND OF THE INVENTION 

Interferon-alphas are members of the diverse helical-bundle superfamily of cytokine 
genes (Sprang, S.R. et al. (1993) Curr. Opin. Struct. Biol. 3:815-827). The human 
interferon-alphas are encoded by a family of over 20 tandemly duplicated nonallelic 
genes and psiiedogenes that share 85-98% sequence identity at the amino acid level 
(Henco, K. et al. (1985) J. Mol. Biol. 185:227-260). Genes which express active 
interferon-alpha proteins have been grouped into 13 families according to genetic loci. 
Known expressed human interferon-alpha proteins and their allelic variations are 
tabulated in Allen G. and Diaz M.O. (1996) J. Interferon and Cytokine Res. 16:181-184. 

Interferon-alphas have been shown to inhibit various types of cellular proliferation, 
and are especially useful for the treatment of a variety of cellular proliferation disorders 
frequently associated with cancer, particularly hematologic malignancies such as 
leukemias. These proteins have shown antiproliferative activity against multiple 
myeloma, chronic lymphocytic leukemia, low-grade lymphoma, Kaposi's sarcoma, 
chronic myelogenous leukemia, renal-cell carcinoma, urinary bladder tumors and ovarian 
cancers (Bonnem, E.M. et al. (1984) J. Biol. Response Modifiers 3:580; Oldham, R.K. 
(1985) Hospital Practice 20:71). 
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Interferon-alphas are also useful against various types of viral infections (Finter, 
N.B. et al. (1991) Drugs 42(5):749). Interferon-alphas have activity against human 
papillomavirus infection, Hepatitis B, and Hepatitis C infections (Finter, N.B. et al., 1991, 
supra; Kashima, H. et al. (1988) Laryngoscope 98:334; Dusheiko, G.M. et al. (1986) J. 
5 Hematology 3 (Supple. 2):S199; Davis, GL et al. (1989) N. England J. Med. 321:1501). 
The role of interferons and interferon receptors in the pathogenesis of certain autoimmune 
and inflammatory diseases has also been investigated (Benoit, P. et al. (1993) J. Immunol. 
150(3):707). 

Although these proteins possess therapeutic value in the treatment of a number of 
10 diseases, they have not been optimized for use as pharmaceuticals. For example, dose- 
limiting toxicity, receptor cross-reactivity, and short serum half-lives significantly reduce 
the clinical utility of many of these cytokines (Dusheiko, G. (1997) Hepatology 26:1 12S- 
12 IS; Vial, T. and Descotes, J. (1994) Drug Experience 10: 1 15-150; Funke, L et al. 
(1994) Ann. Hematol. 68:49-52; Schomburg, A. et al. (1993) J. Cancer Res. Clin. Oncol. 
15 1 19:745-755). Diverse and severe side effect profiles which accompany interferon 

administration include flu-like symptoms, fatigue, hallucination, fever, hepatic enzyme 
elevation, and leukopenia (Pontzer, C.H. et al. (1991) Cancer Res. 51:5304; Oldham, 
1985, supra). 

Hepatitis-C virus (HCV) is a nonhost integrated RNA virus with a very high rate of 
20 replication and is therefore associated with a large degree of genetic diversity. At least 
six genotypes and more than thirty subtypes of HCV RNA have been identified. HCV 
genotype has been shown to be a predictor of response to IFN-alpha therapy. Patients 
infected with HCV genotypes 2 and 3 have been found to generally respond well to 
interferon therapy. Patients infected with genotypes 4, 5 and 6 tend to respond less well. 
25 Patients infected with HCV genotype 1 tend to respond very poorly to interferon therapy, 
with about 50% of Genotype 1 patients classified as "nonresponders" towards IFN-alpha 
therapy. Genotype 1 is currently the most prevalent form of Hepatitis C, infecting 
approximately 70% of patients in the US and 50% of patients in Europe. Clearly, there is 
a pressing need for more effective therapies for HCV infection, particularly of the 
30 Genotype 1 variety. 

There is genetic and biochemical evidence that Genotype 1 HCV (and other 
subtypes) actively attenuate the IFN-alpha signaling pathway by inhibiting key IFN 
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responsive proteins such as the dsRNA-activated serine/threonine protein kinase PKR 
(Katze M.G., et al (2002) Nat. Rev. Immunol. 2(9):675-687) . As a likely consequence 
of this genetic diversity and active inhibition of the antiviral response, HCV (particularly 
Genotype 1) has the ability to escape the host's immune surveillance, leading to a high 
5 rate of chronic infection. The extensive genetic heterogeneity of HCV has important 
diagnostic and clinical implications, potentially accounting for variations in clinical 
course, difficulties in vaccine development, and lack of response to therapy. 

The present invention addresses the need for interferon-alpha molecules which 
exhibit enhanced antiviral and/or immunomodulatory efficacy. The invention provides 
10 novel interferon-alpha polypeptides and polypeptide conjugates, nucleic acids encoding 
the polypeptides, and methods of using such molecules. Such molecules would be of 
beneficial use in a variety of applications, including, e.g., therapeutic and prophylactic 
treatments, particularly for viral infections and diseases and conditions associated with 
viral infections. The present invention fulfills these and other needs. 

15 SUMMARY OF THE INVENTION 

The present invention provides novel polypeptides, including variants thereof and 
fusion proteins comprising such polypeptides. The invention also provides conjugates 
comprising a polypeptide of the invention covalently linked to one or more non- 
polypeptide moieties. The invention also provides nucleic acids encoding any of the 

20 polypeptides of the invention, and vectors and host cells comprising such nucleic acids. In 
addition, the invention provides methods of making and using such polypeptides, 
conjugates, and nucleic acids, and other features apparent upon further review. 

In one aspect, the invention provides an isolated or recombinant polypeptide, the 
polypeptide comprising a sequence identified as one of SEQ ID NOs:2-35 and SEQ ED 

25 NOs: 37-44 (such as one of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID 
NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 10, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 

30 NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID 
NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 
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NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID 
NO:42, SEQ ID NO:43, and SEQ ID NO:44). The invention also provides fusion proteins 
and conjugates comprising any of these polypeptides, nucleic acids encoding these 
polypeptides, and methods of making and using these polypeptides. 

5 The invention also provides isolated or recombinant polypeptides which each 

comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID NO: 1 
and (b) comprises one or more substitutions relative to SEQ ID NO: 1 selected from the 
group consisting of F48A/L; V51P; F55A; F65A; F68P; LI 11 A; and V114P. Some such 
polypeptides comprise a sequence which differs from SEQ ID NO;l in 1,2, 3, 4, 5, 6, 7, 

10 8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 
in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 
1-1 1 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 
8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 

15 amino acid positions. In some instances, the polypeptide further comprises one or more 
substitutions relative to SEQ ID NO: 1 selected from the group consisting of H47Q; 
V51A; Q52P/E; A53T; F55S; L56V; F57L; Y58H; M61I; N113K; V114E; andE160D. 
In some instances, the polypeptide exhibits an interferon-alpha activity (such as, e.g., 
antiviral activity, ThI differentiation activity, and/or antiproliferative activity). The 

20 invention also provides fusion proteins and conjugates comprising any of these 

polypeptides, nucleic acids encoding such polypeptides, and methods of making and 
using such polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 0 to 16 amino acid positions from SEQ ID 

25 NO: 13 and (b) comprises one or more of: Ala or Leu at position 48; Pro at position 51; 
Ala at position 55; Ala at position 65; Pro at position 68; Ala at position 111; and Pro at 
position 1 14 (position numbering relative to SEQ ID NO: 13). Some such polypeptides 
comprise a sequence which differs from SEQ ID NO: 13 in 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 0-15 amino acid positions, in 0-14 

30 amino acid positions, in 0-13 amino acid positions, in 0-12 amino acid positions, in 0-1 1 
amino acid positions, in 0-10 amino acid positions, in 0-9 amino acid positions, in 0-8 
amino acid positions, in 0-7 amino acid positions, in 0-6 amino acid positions, in 0-5 
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amino acid positions, in 0-4 amino acid positions, in 0-3 amino acid positions, in 0-2 
amino acid positions, or in 0-1 amino acid positions. Some such polypeptides comprise 
an Ala or a Leu at position 48. Some such polypeptides comprise an Ala at position 48. 
In some instances, the polypeptide further comprises one or more substitution(s) relative 
5 to SEQ ID NO: 13 selected from the group consisting of H47Q; V5 1 A; Q52P/E; A53T; 
F55S; L56V; F57L; Y58H; M61I; Nl 13K; VI 14E; and E160D. In some instances, the 
polypeptide exhibits an interferon-alpha activity (e.g., antiviral activity, T H 1 
differentiation activity, and/or antiproliferative activity). Some such polypeptides further 
comprise one or more additional amino acid(s), such as a methionine added to the N- 

10 terminus. The invention also provides fusion proteins and conjugates comprising these 
polypeptides, and isolated or recombinant nucleic acids encoding these polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID 
NO: 36 and (b) comprises one or more substitutions relative to SEQ ID NO: 36 selected 

15 from the group consisting of M21 A; I24P; F48A/L; T5 IP; S55A; F65A; F68P; F90A; 
M93P; LI 1 1 A; VI 14P; F124A; I127P and E160D. Some such polypeptides comprise a 
sequence which differs from SEQ ID NO:36 in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 
15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, in 1-14 amino acid 
positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 1-11 amino acid 

20 positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1-8 amino acid 
positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 amino acid 
positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 amino acid 
positions. In some instances, the polypeptide further comprises one or more substitutions 
relative to SEQ ID NO:36 selected from the group consisting of P26L; H47Q; T51V; 

25 S55P/F; V56L; H58Y; L60M; F90Y; M93L; N95D; N113K; V114E; R125Q; T132K; 
and L154F. In some instances, the polypeptide exhibits an interferon-alpha activity 
(such as, e.g., antiviral activity, T H 1 differentiation activity, and/or antiproliferative 
activity). The invention also provides fusion proteins and conjugates comprising any of 
these polypeptides, nucleic acids encoding such polypeptides, and methods of making and 

30 using such polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 0 to 16 amino acid positions from SEQ ID 

5 
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NO:38 and (b) comprises one or more of: Ala at position 21; Pro at position 24; Ala or 
Leu at position 48; Pro at position 51; Ala at position 55; Ala at position 65; Pro at 
position 68; Ala at position 90; Pro at position 93; Ala at position 111; Pro at position 
114; Ala at position 124; Pro at position 127; and Glu at position 160 (position numbering 
5 relative to SEQ ID NO: 38). Some such polypeptides comprise a sequence which differs 
from SEQ ID NO:38 in 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid 
positions, e.g., in 0-15 amino acid positions, in 0-14 amino acid positions, in 0-13 amino 
acid positions, in 0-12 amino acid positions, in 0-1 1 amino acid positions, in 0-10 amino 
acid positions, in 0-9 amino acid positions, in 0-8 amino acid positions, in 0-7 amino acid 

10 positions, in 0-6 amino acid positions, in 0-5 amino acid positions, in 0-4 amino acid 
positions, in 0-3 amino acid positions, in 0-2 amino acid positions, or in 0-1 amino acid 
positions. Some such polypeptides comprise an Ala or a Leu at position 48. Some such 
polypeptides comprise an Ala at position 48. In some instances, the polypeptide further 
comprises one or more substitution(s) relative to SEQ ID NO:38 selected from the group 

15 consisting of P26L; H47Q; V51T; F55P/S; L56V; Y58H; L60M; F90Y; M93L; N95D; 
Nl 13K; VI 14E; R125Q, T132K; and F154L. In some instances, the polypeptide 
exhibits an interferon-alpha activity (e.g., antiviral activity, T H 1 differentiation activity, 
and/or antiproliferative activity). Some such polypeptides further comprise one or more 
additional amino acid(s), such as a methionine added to the N-terminus. The invention 

20 also provides fusion proteins and conjugates comprising these polypeptides, nucleic acids 
encoding such polypeptides, and methods of making and using such polypeptides. 

The invention also provides conjugates comprising a polypeptide of the invention, 
such as any of the polypeptides of the invention described above, and at least one non- 
polypeptide moiety attached to an attachment group of the polypeptide, wherein the 

25 conjugate exhibits an interferon-alpha activity. In some instances, the non-polypeptide 
moiety is a polymer (for example, a polyalkylene oxide molecule, such as a polyethylene 
glycol (PEG), such as a monomethoxypolyethylene glycol (mPEG)), or a sugar moiety. 
The at least one non-polypeptide moiety may, for example, be attached to a cysteine, to a 
lysine, to the N-terminal amino group of the polypeptide, or to an in vivo glycosylation 

30 site of the polypeptide. The invention also provides methods of making and using such 
conjugates. 

6 
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The invention also provides isolated or recombinant nucleic acids encoding any of 
the polypeptides of the invention. The invention also provides vectors and host cells 
comprising such nucleic acids, and methods of making polypeptides of the invention 
comprising culturing host cells comprising such nucleic acids. 
5 In another aspect, the invention provides a method of inhibiting viral replication in 

virus-infected cells, the method comprising contacting the virus-infected cells with a 
polypeptide or a conjugate of the invention. The invention also provides a method of 
reducing the number of copies of a virus in virus-infected cells, comprising contacting the 
virus-infected cells with a polypeptide or a conjugate of the invention. 

10 In another aspect, the invention provides a method of treating a condition which is 

responsive to interferon-alpha, comprising administering to a subject afflicted with the 
condition a composition comprising a polypeptide of the invention or a conjugate of the 
invention in an amount effective to ameliorate a symptom associated with the condition. 
Such conditions include Chronic Hepatitis C infection, Chronic Hepatitis B infection, 

15 Hairy Cell Leukemia, Malignant Melanoma, Follicular Lymphoma, Condylomata 
Acuminata, AIDS-related Kaposi's Sarcoma, Non-Hodgkin's Lymphoma, Chronic 
Melogenous Leukemia, Basal Cell Carcinoma, Multiple Myeloma, carcinoid tumors, 
bladder cancer, Crohn's Disease, Cutaneous T Cell Lymphoma, Renal Cell Carcinoma, 
Multiple Sclerosis, and AIDS. The invention also includes the use of a composition 

20 comprising a polypeptide of the invention or a conjugate of the invention to treat a 
condition which is responsive to interferon-alpha, such as for example a condition 
described above. 

These and other objects and features of the invention will become more fully 
apparent when the following detailed description is read in conjunction with the 
25 accompanying figures. 

BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A and IB show biphasic timecourses for viral clearance from HCV- 
infected cells following IFN-alpha treatment (A. Nonresponder kinetics; B. Responder 
30 kinetics). 

Figure 2 shows an alignment of the sequence of a polypeptide of the invention (SEQ 
ID NO: 13) with the following human interferon-alpha (huIFN-alpha) polypeptide 
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sequences: huIFN-alpha la (SEQ ID NO:45), huIFN-alpha 2b (SEQ ID NO:46), huIFN- 
alpha 4b (SEQ ID NO:48), huIFN-alpha 5 (SEQ ID NO:49), huIFN-alpha 6 (SEQ ID 
NO:50), huIFN-alpha 7 (SEQ ID NO:51), huIFN-alpha 8b (SEQ ID NO:52), huIFN-alpha 
10a (SEQ ID NO:53), huIFN-alpha 14a (SEQ ID NO:54), huIFN-alpha 16 (SEQ ID 
NO:55), huIFN-alpha 17b (SEQ ID NO:56) and huIFN-alpha 21b (SEQ ID NO:57). The 
naming conventions for the huIFN-alpha sequences are according to Allen G. and Diaz 
M.O. (1996) J. Interferon and Cytokine Res. 16:181-184. Amino acid residue positions in 
SEQ ID NOs:45-46 and 48-57 which are identical to SEQ ID NO: 13 are indicated with a 
period (.), and gaps in the sequence are indicated with a dash (-). 

Figure 3 shows an alignment of the sequence of a polypeptide of the invention (SEQ 
ID NO:38) with human interferon-alpha polypeptide sequences SEQ ID NOs:45-46 and 
48-57. Amino acid residue positions in SEQ ID NOs:45-46 and 48-57 which are identical 
to SEQ ID NO: 38 are indicated with a period (.), and gaps in the sequence are indicated 
with a dash (-). 

Figure 4 shows the BLOSUM62 substitution matrix. 

Figures 5A, 5B and 5C show examples of calculations of alignment scores used to 
determine optimal sequence alignments, using the following parameters: BLOSUM62 
matrix, gap open penalty =11, and gap extension penalty = 1. 

Figures 6 and 7 show E. coli expression vectors designated OripBR-URA3- 
Iacl_14epil8-mut2 and OripBR-URA3-lacI_25epil9-mut, respectively. 

Figure 8 shows the separation via cation exchange HPLC of positional isomers of an 
exemplary PEGylated interferon-alpha conjugate. 

Figure 9 shows an SDS_PAGE gel of fractions from the cation exchange HPLC 
separation of Figure 8. 

DETAILED DESCRIPTION OF THE INVENTION 

DEFINITIONS 

Unless otherwise defined herein or in the remainder of the specification, all technical 
and scientific terms used herein have the same meaning as commonly understood by 
those of ordinary skill in the art to which the invention belongs. 
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A "polypeptide sequence" (e.g., a protein, polypeptide, peptide, etc.) is a polymer of 
amino acids comprising naturally occurring amino acids or artificial amino acid 
analogues, or a character string representing an amino acid polymer, depending on 
context. Given the degeneracy of the genetic code, one or more nucleic acids, or the 

5 complementary nucleic acids thereof, that encode a specific polypeptide sequence can be 
determined from the polypeptide sequence. 

A "polynucleotide sequence" (e.g., a nucleic acid, polynucleotide, oligonucleotide, 
etc.) is a polymer of nucleotides comprising nucleotides A,C,T,U,G, or other naturally 
occurring nucleotides or artificial nucleotide analogues, or a character string representing 

10 a nucleic acid, depending on context. Either the given nucleic acid or the complementary 
nucleic acid can be determined from any specified polynucleotide sequence. 

Numbering of a given amino acid polymer or nucleic acid polymer "corresponds to" 
or is "relative to" the numbering of a selected amino acid polymer or nucleic acid 
polymer when the position of any given polymer component (e.g., amino acid, nucleotide, 

15 also referred to generically as a "residue") is designated by reference to the same or an 
equivalent position in the selected amino acid or nucleic acid polymer, rather than by the 
actual numerical position of the component in the given polymer. Thus, for example, the 
numbering of a given amino acid position in a given polypeptide sequence corresponds to 
the same or equivalent amino acid position in a selected polypeptide sequence used as a 

20 reference sequence. 

An "equivalent position" (for example, an "equivalent amino acid position" or 
"equivalent nucleic acid position" or "equivalent residue position") is defined herein as a 
position (such as, an amino acid position or nucleic acid position or residue position) of a 
test polypeptide (or test polynucleotide) sequence which aligns with a corresponding 

25 position of a reference polypeptide (or reference polynucleotide) sequence, when 

optimally aligned using an alignment algorithm as described herein. The equivalent 
amino acid position of the test polypeptide need not have the same numerical position 
number as the corresponding position of the reference polypeptide; likewise, the 
equivalent nucleic acid position of the test polynucleotide need not have the same 

30 numerical position number as the corresponding position of the reference polynucleotide. 
As an example, Figure 2 shows the sequence of a polypeptide of the invention (SEQ ID 
NO: 13) optimally aligned with various known human interferon-alpha polypeptide 



9 



WO 2005/113592 



PCT/US2005/0 17471 



sequences. In this example, amino acid position number 48 of SEQ ID NO: 13 is 
considered to be an equivalent amino acid position to (i.e. is "equivalent to") that of 
amino acid position number 47 of SEQ ID NO:46 (huIFN-alpha 2b), since amino acid 
number 48 of SEQ ID NO: 13 aligns with amino acid number 47 of SEQ ID NO:46. In 
5 other words, amino acid position 48 of SEQ ID NO: 13 "corresponds to" amino acid 
position 47 of SEQ ID NO:46. Likewise, residue A48 (Ala48) in SEQ ID NO: 13 is 
understood to correspond to residue F48 (Phe48) in, e.g., SEQ ID NO:l, so that for 
example the substitution A48L relative to SEQ ID NO: 13 is understood to correspond to 
the substitution F48L relative to, e.g., SEQ ID NO:l (and so on). 

10 Two polypeptide sequences are "optimally aligned". when they are aligned using 

defined parameters, i.e., a defined amino acid substitution matrix, gap existence penalty 
(also termed gap open penalty), and gap extension penalty, so as to arrive at the highest 
similarity score possible for that pair of sequences. The BLOSUM62 matrix (Henikoff 
and Henikoff (1992) Proc. Natl.'Acad. Sci. USA 89(22): 10915-10919) is often used as a 

15 default scoring substitution matrix in polypeptide sequence alignment algorithms (such as 
BLASTP). The gap existence penalty is imposed for the introduction of a single amino 
acid gap in one of the aligned sequences, and the gap extension penalty is imposed for 
each residue position in the gap. Unless otherwise stated, alignment parameters 
employed herein are: BLOSUM62 scoring matrix, gap existence penalty =11, and gap 

20 extension penalty = 1. The alignment score is defined by the amino acid positions of each 
sequence at which the alignment begins and ends (e.g. the alignment window), and 
optionally by the insertion of a gap or multiple gaps into one or both sequences, so as to 
arrive at the highest possible similarity score, as described in more detail below in the 
section entitled "Percent Sequence Identity", 

25 The terminology used for identifying amino acid positions and amino acid 

substitutions is illustrated as follows: F48 indicates position number 48 is occupied by a 
phenylalanine (Phe) residue in a reference amino acid sequence, such as SEQ ID NO: 1. 
F48A indicates that the phenylalanine residue of position 48 has been substituted with a 
alanine (Ala) residue. Alternative substitutions are indicated with a "/", e.g., F48A/L 

30 means an amino acid sequence in which the phenylalanine residue at position 48 is 

substituted with an alanine or a leucine residue. Multiple substitutions may sometimes be 
indicated with a "+", e.g. H47Q+ F48A/L means an amino acid sequence which 
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comprises a substitution of the histidine residue at position 47 with an glutamine residue 
and a substitution of the phenylalanine residue in position 48 with an alanine or a leucine 
residue Deletions are indicated by an asterix. For example, H47* indicates that the 
histidine residue in position 47 has been deleted. Deletions of two or more continuous 
5 amino acids may be indicated as follows, e.g., R161*-E166* indicates the deletion of 
residues R161-E166 inclusive (that is, residues 161, 162, 163, 164, 164, and 166 are 
deleted). Insertions are indicated the following way: Insertion of an additional serine 
residue after the histidine residue located at position 47 is indicated as H47HS. Combined 
substitutions and insertions are indicated in the following way: Substitution of the 

1 0 histidine residue at position 47 with a serine residue and insertion of an alanine residue 
after the position 47 amino acid residue is indicated as H47SA. 

Unless otherwise indicated, the position numbering of amino acid residues recited 
herein is relative to the amino acid sequence SEQ ID NO:l. It is to be understood that 
while the examples and modifications to the parent polypeptide are generally provided 

15 herein relative to the sequence SEQ ID NO: 1 (or relative to another specified sequence), 
the examples pertain to other polypeptides of the invention, and the modifications 
described herein may be made in equivalent amino acid positions of any of the other 
polypeptides described herein. Thus, as an example, the substitution F48L relative to 
SEQ ID NO:l is understood to correspond to the substitution A48L in SEQ ID NO: 13, 

20 and so on. 

The term "exhibiting (e.g., exhibits, or having, or has) an interferon-alpha activity" is 
intended to indicate that the polypeptide or conjugate of the invention has at least one 
activity exhibited by a reference interferon-alpha polypeptide (such as, for example, a 
human interferon-alpha polypeptide, e.g., huIFN-alpha 2b identified herein as SEQ ID 

25 NO:46, huIFN-alpha 2a identified herein as SEQ ID NO:47, hlFN-alpha 8b identified 

herein as SEQ ID NO:5 1, or any other interferon alpha polypeptide known in the art, such 
as, for example, those listed in Allen G. and Diaz M.O. (1996, supra)). Such activity 
includes the ability to signal through an interferon-alpha receptor, as evidenced by, for 
example, one or more of: inhibition of viral replication in virus-infected cells ("antiviral 

30 activity"); enhancement of differentiation of naive T-cells to a T H 1 phenotype and/or 
suppression of differentiation of naive T-cells to a Th2 phenotype ("T H 1 differentiation 
activity"); or inhibition of cell proliferation ("antiproliferative activity"). The one or 
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more interferon-alpha activity is assayed using assays known in the art and/or described 
in the Examples. 

A polypeptide or a conjugate exhibiting an interferon-alpha activity is considered to 
have such activity when it displays a measurable activity, e.g., a measurable antiviral 

5 activity, antiproliferative activity, or T H 1 differentiation activity (e.g., as determined by 
assays known in the art and/or described in the Examples). One of skill in the art 
recognizes that what constitutes a measurable activity depends in part on the nature of the 
assay being undertaken, but as a general guideline a measurable activity is one in which 
the assay signal generated in the presence of the test compound (e.g., a polypeptide of the 

10 invention) is quantifiably different than the assay signal generated in the absence of the 
test compound. It is to be understood that a polypeptide or conjugate of the invention 
need not exhibit all of the known activities of a particular reference interferon-alpha, or 
exhibit such activities to the same extent as the reference interferon-alpha. In some 
instances the activity exhibited by a polypeptide or conjugate of the invention (as 

15 evidenced, e.g., by an EC 50 , specific activity, or other value related to activity) may be 
about equal to, be less than, or be greater than that of the particular activity exhibited by 
the reference interferon-alpha. 

A "variant" is a polypeptide comprising a sequence which differs in one or more 
amino acid position(s) from that of a parent polypeptide sequence. For example, a variant 

20 may comprise a sequence which differs from the parent polypeptides sequence in up to 
10% of the total number of residues of the parent polypeptide sequence, such as in up to 
8% of the residues, e.g., in up to 6%, 5%, 4%, 3% 2% or 1% of the total number of 
residue of the parent polypeptide sequence. For example, a variant of SEQ ID NO: 1 may 
comprise a sequence which differs from SEQ ID NO: 1 in 1-16 amino acid positions (such 

25 as in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, or 16 amino acid positions), e.g. in 1-15 
amino acid positions, in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 
amino acid positions, in 1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 
amino acid positions, in 1-8 amino acid positions, in 1-7 amino acid positions, in 1-6 
amino acid positions, in 1-5 amino acid positions, in 1-4 amino acid positions, in 1-3 

30 amino acid positions, or in 1-2 amino acid positions. 

The term "parent polypeptide" or "parent interferon-alpha" is intended to indicate 
the polypeptide sequence to be modified in accordance with the present invention. The 
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parent polypeptide sequence may be that of a naturally occurring IFN-alpha (such as a 
mammalian IFN-alpha, e.g., a primate IFN-alpha, such as a human IFN-alpha, such as a 
huIFN-alpha polypeptide identified herein as SEQ ID NOs:45-57, or other huIFN-alpha 
sequence such as those described herein and/or in Allen G. and Diaz M.O. (1996), supra). 
5 The parent polypeptide sequence may be that of a non-naturally occurring (i.e., 
"synthetic") interferon-alpha, such as IFN-alpha Conl (SEQ ID NO:58). In some 
instances, the parent polypeptide to be modified may itself be a polypeptide of the 
invention, such as, for example, any one of SEQ ID NOs:2-35 and SEQ ID NOs:37-44. 
"Naturally occurring" as applied to an object refers to the fact that the object can be 

10 found in nature as distinct from being artificially produced by marl. For example, a 

polypeptide or polynucleotide sequence that is present in an organism (including viruses, 
bacteria, protozoa, insects, plants or mammalian tissue) that can be isolated from a source 
in nature and which has not been intentionally modified by man in the laboratory is 
naturally occurring. "Non-naturally occurring" (also termed "synthetic" or "artificial") as 

15 applied to an object means that the object is not naturally-occurring — i.e., the object 
cannot be found in nature as distinct from being artificially produced by man. 

A "fragment" or "subsequence" is any portion of an entire sequence, up to but not 
including the entire sequence. Thus, a fragment or subsequence refers to a sequence of 
amino acids or nucleic acids that comprises a part of a longer sequence of amino acids 

20 (e.g., polypeptide) or nucleic acids (e.g., polynucleotide). 

One type of fragment contemplated by the present invention is a fragment in which 
amino acid residues are removed from the N-terminus or the C-terminus of the parent 
polypeptide (or both); such a polypeptide is considered to be "N-terminally truncated" or 
"C-terminally truncated", respectively. It is known that deletion of at least the first four 

25 amino acids from the N-terminus does not significantly affect interferon-alpha activity 
(Lydon, N.B. et al (1985) Biochemistry 24: 4131-41). Furthermore, variants retaining 
interferon-alpha activity have been described wherein between 7 and 1 1 amino acids have 
been deleted from the C-terminus (Cheetham B.F. et al (1991) Antiviral Res. 15(1):27- 
39; Chang NT. et al (1983) Arch. Biochem Biophys. 221(2): 585-589; Franke A.E. et 

30 al (1982) DNA l(3):223-230). 

A "receptor" e.g., an "interferon-alpha receptor" (also known as a "Type I interferon 
receptor") is a receptor which is activated in cells by an interferon-alpha, e.g., binds an 
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interferon-alpha and initiates intracellular signaling, such as a type I interferon receptor 
comprising receptor subunits IFNAR-2 and EFNAR-1 (Domanski et al. (1998) J. Biol. 
Chem. 273(6):3 144-3 147; Mogensen et aL, (1999) Journal of Interferon and Cytokine 
Research, 19: 1069-1098). In the context of this invention, receptor is also meant to 
5 include truncated forms of a full-length receptor molecule, such as for example a receptor 
molecule which lacks a membrane-binding portion, such as a soluble form of a receptor 
molecule (also known as a "soluble receptor") which comprises an extracelluar binding 
domain, which binds an interferon-alpha, but may not necessarily bind to a membrane 
and/or initiate intracellular signaling. 

10 A "specific binding affinity" between two molecules, e.g., a ligand and a receptor, 

means a preferential binding of one molecule for another in a mixture of molecules. The 
binding of the molecules is typically considered specific if the binding affinity is about 1 
x 10 4 M' 1 to about 1 x 10 9 M" 1 or greater (i.e., K D of about 10" 4 to 10" 9 M or less). 
Binding affinity of a ligand and a receptor may be measured by standard techniques 

15 known to those of skill in the art. Non-limiting examples of well-known techniques for 
measuring binding affinities include Biacore® technology (Biacore AB, Sweden), 
isothermal titration microcalorimetry (MicroCal LLC, Northampton, MA USA), ELISA, 
and FACS. For example, FACS or other sorting methods may be used to select for 
populations of molecules (such as for example, cell surface-displayed ligands) which 

20 specifically bind to the associated binding pair member (such as a receptor, e.g., a soluble 
receptor). Ligand-receptor complexes may be detected and sorted e.g., by fluorescence 
(e.g., by reacting the complex with a fluorescent antibody that recognizes the complex). 
Molecules of interest which bind an associated binding pair member (e.g., receptor) are 
pooled and re-sorted in the presence of lower concentrations of receptor. By performing 

25 multiple rounds sorting in the presence of decreasing concentrations of receptor (an 
exemplary concentration range being on the order of 10' 6 M down to 10" 9 M, i.e., 1 
micromolar (|oM) down to 1 nanomolar (nM), or less, depending on the nature of the 
ligand-receptor interaction), populations of the molecule of interest exhibiting specific 
binding affinity for the receptor may be isolated. 

30 A polypeptide, nucleic acid, or other component is "isolated" when it is partially or 

completely separated from components with which it is normally associated (other 
peptides, polypeptides, proteins (including complexes, e.g., polymerases and ribosomes 
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which may accompany a native sequence), nucleic acids, cells, synthetic reagents, cellular 
contaminants, cellular components, etc.), e.g., such as from other components with which 
it is normally associated in the cell from which it was originally derived. A polypeptide, 
nucleic acid, or other component is isolated when it is partially or completely recovered 
5 or separated from other components of its natural environment such that it is the 

predominant species present in a composition, mixture, or collection of components (i.e., 
on a molar basis it is more abundant than any other individual species in the 
composition). In some instances, the preparation consists of more than about 60%, 70% 
or 75%, typically more than about 80%, or preferably more than about 90% of the 

10 isolated species. 

A "substantially pure" or "isolated" nucleic acid (e.g., RNA or DNA), polypeptide, 
protein, or composition also means where the object species (e.g., nucleic acid or 
polypeptide) comprises at least about 50, 60, or 70 percent by weight (on a molar basis) 
of all macromolecular species present. A substantially pure or isolated composition can 

15 also comprise at least about 80, 90, or 95 percent by weight of all macromolecular species 
present in the composition. An isolated object species can also be purified to essential 
homogeneity (contaminant species cannot be detected in the composition by conventional 
detection methods) wherein the composition consists essentially of derivatives of a single 
macromolecular species. The term "purified" generally denotes that a nucleic acid, 

20 polypeptide, or protein gives rise to essentially one band in an electrophoretic gel. It 
typically means that the nucleic acid, polypeptide, or protein is at least about 50% pure, 
60% pure, 70% pure, 75% pure, more preferably at least about 85% pure, and most 
preferably at least about 99% pure. 

The term "isolated nucleic acid" may refer to a nucleic acid (e.g., DNA or RNA) that 

25 is not immediately contiguous with both of the coding sequences with which it is 

immediately contiguous (i.e., one at the 5' and one at the 3' end) in the naturally occurring 
genome of the organism from which the nucleic acid of the invention is derived. Thus, 
this term includes, e.g., a cDNA or a genomic DNA fragment produced by polymerase 
chain reaction (PCR) or restriction endonuclease treatment, whether such cDNA or 

30 genomic DNA fragment is incorporated into a vector, integrated into the genome of the 
same or a different species than the organism, including, e.g., a virus, from which it was 
originally derived, linked to an additional coding sequence to form a hybrid gene 
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encoding a chimeric polypeptide, or independent of any other DNA sequences. The DNA 
may be double-stranded or single-stranded, sense or antisense. 

A "recombinant polynucleotide" or a "recombinant polypeptide" is a non-naturally 
occurring polynucleotide or polypeptide which may include nucleic acid or amino acid 

5 sequences, respectively, from more than one source nucleic acid or polypeptide, which 
source nucleic acid or polypeptide can be a naturally occurring nucleic acid or 
polypeptide, or can itself have been subjected to mutagenesis or other type of 
modification. A nucleic acid or polypeptide may be deemed "recombinant" when it is 
synthetic or artificial or engineered, or derived from a synthetic or artificial or engineered 

10 polypeptide or nucleic acid. A recombinant nucleic acid (e.g., DNA or RNA) can be 
made by the combination (e.g., artificial combination) of at least two segments of 
sequence that are not typically included together, not typically associated with one 
another, or are otherwise typically separated from one another. A recombinant nucleic 
acid can comprise a nucleic acid molecule formed by the joining together or combination 

15 of nucleic acid segments from different sources and/or artificially synthesized. A 

"recombinant polypeptide" often refers to a polypeptide that results from a cloned or 
recombinant nucleic acid. The source polynucleotides or polypeptides from which the 
different nucleic acid or amino acid sequences are derived are sometimes homologous 
(i.e., have, or encode a polypeptide that encodes, the same or a similar structure and/or 

20 function), and are often from different isolates, serotypes, strains, species, of organism or 
from different disease states, for example. 

The term "recombinant" when used with reference, e.g., to a cell, polynucleotide, 
vector, protein, or polypeptide typically indicates that the cell, polynucleotide, or vector 
has been modified by the introduction of a heterologous (or foreign) nucleic acid or the 

25 alteration of a native nucleic acid, or that the protein or polypeptide has been modified by 
the introduction of a heterologous amino acid, or that the cell is derived from a cell so 
modified. Recombinant cells express nucleic acid sequences that are not found in the 
native (non-recombinant) form of the cell or express native nucleic acid sequences that 
would otherwise be abnormally expressed, under-expressed, or not expressed at all. The 

30 term "recombinant" when used with reference to a cell indicates that the cell replicates a 
heterologous nucleic acid, or expresses a polypeptide encoded by a heterologous nucleic 
acid. Recombinant cells can contain coding sequences that are not found within the 
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native (non-recombinant) form of the cell. Recombinant cells can also contain coding 
sequences found in the native form of the cell wherein the coding sequences are modified 
and re-introduced into the cell by artificial means. The term also encompasses cells that 
contain a nucleic acid endogenous to the cell that has been modified without removing the 
5 nucleic acid from the cell; such modifications include those obtained by gene 
replacement, site-specific mutation, recombination, and related techniques. 

The term "recombinantly produced" refers to an artificial combination usually 
accomplished by either chemical synthesis means, recursive sequence recombination of 
nucleic acid segments or other diversity generation methods (such as, e.g., shuffling) of 

10 nucleotides, or manipulation of isolated segments of nucleic acids, e.g., by genetic 
engineering techniques known to those of ordinary skill in the art. "Recombinantly 
expressed" typically refers to techniques for the production of a recombinant nucleic acid 
in vitro and transfer of the recombinant nucleic acid into cells in vivo, in vitro, or ex vivo 
where it may be expressed or propagated. 

15 A "recombinant expression cassette" or simply an "expression cassette" is a nucleic 

acid construct, generated recombinantly or synthetically, with nucleic acid elements that 
are capable of effecting expression of a structural gene in hosts compatible with such 
sequences. Expression cassettes include at least promoters and optionally, transcription 
termination signals. Typically, the recombinant expression cassette includes a nucleic 

20 acid to be transcribed (e.g., a nucleic acid encoding a desired polypeptide), and a 

promoter. Additional factors necessary or helpful in effecting expression may also be 
used as described herein. For example, an expression cassette can also include nucleotide 
sequences that encode a signal sequence that directs secretion of an expressed protein 
from the host cell. Transcription termination signals, enhancers, and other nucleic acid 

25 sequences that influence gene expression, can also be included in an expression cassette. 

An "immunogen" refers to a substance capable of provoking an immune response, 
and includes, e.g., antigens, autoantigens that play a role in induction of autoimmune 
diseases, and tumor-associated antigens expressed on cancer cells. An immune response 
generally refers to the development of a cellular or antibody-mediated response to an 

30 agent, such as an antigen or fragment thereof or nucleic acid encoding such agent. In 
some instances, such a response comprises a production of at least one or a combination 
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of CTLs, B cells, or various classes of T cells that are directed specifically to antigen- 
presenting cells expressing the antigen of interest. 

An "antigen" refers to a substance that is capable of eliciting the formation of 
antibodies in a host or generating a specific population of lymphocytes reactive with that 
5 substance. Antigens are typically macromolecules (e.g., proteins and polysaccharides) 
that are foreign to the host. 

An "adjuvant" refers to a substance that enhances an antigen's immune-stimulating 
properties or the pharmacological effect(s) of a drug. An adjuvant may non-specifically 
enhance the immune response to an antigen. "Freund's Complete Adjuvant," for 
10 example, is an emulsion of oil and water containing an immunogen, an emulsifying agent 
and mycobacteria. Another example, "Freund's incomplete adjuvant," is the same, but 
without mycobacteria. 

A vector is a component or composition for facilitating cell transduction or 
transfection by a selected nucleic acid, or expression of the nucleic acid in the cell. 
15 Vectors include, e.g., plasmids, cosmids, viruses, YACs, bacteria, poly-lysine, etc. An 
"expression vector" is a nucleic acid construct or sequence, generated recombinantly or 
synthetically, with a series of specific nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, 
virus, or nucleic acid fragment. The expression vector typically includes a nucleic acid to 
20 be transcribed operably linked to a promoter. The nucleic acid to be transcribed is 
typically under the direction or control of the promoter. 

"Substantially the entire length of a polynucleotide sequence" or "substantially the 
entire length of a polypeptide sequence" refers to at least 50%, generally at least about 
60%, 70%, or 75%, usually at least about 80%, or typically at least about 85%, 90%, 
25 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more of a length of a 
polynucleotide sequence or polypeptide sequence. 

The term "immunoassay" includes an assay that uses an antibody or immunogen to 
bind or specifically bind an antigen. The immunoassay is typically characterized by the 
use of specific binding properties of a particular antibody to isolate, target, and/or 
30 quantify the antigen. 

The term "subject" as used herein includes, but is not limited to, an organism; a 
mammal, including, e.g., a human, non-human primate (e.g., baboon, orangutan, 
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monkey), mouse, pig, cow, goat, cat, rabbit, rat, guinea pig, hamster, horse, monkey, 
sheep, or other non-human mammal; a non-mammal, including, e.g., a non-mammalian 
vertebrate, such as a bird (e.g., a chicken or duck) or a fish, and a non-mammalian 
invertebrate. 

5 The term "pharmaceutical composition" means a composition suitable for 

pharmaceutical use in a subject, including an animal or human. A pharmaceutical 
composition generally comprises an effective amount of an active agent and a carrier, 
including, e.g., a pharmaceutically acceptable carrier. 

The term "effective amount" means a dosage or amount sufficient to produce a 

10 desired result. The desired result may comprise an objective or subjective improvement 
in the recipient of the dosage or amount. 

A "prophylactic treatment" is a treatment administered to a subject who does not 
display signs or symptoms of a disease, pathology, or medical disorder, or displays only 
early signs or symptoms of a disease, pathology, or disorder, such that treatment is 

15 administered for the purpose of diminishing, preventing, or decreasing the risk of 
developing the disease, pathology, or medical disorder. A prophylactic treatment 
functions as a preventative treatment against a disease or disorder. A "prophylactic 
activity" is an activity of an agent, such as a nucleic acid, vector, gene, polypeptide, 
protein, substance, or composition thereof that, when administered to a subject who does 

20 not display signs or symptoms of pathology, disease or disorder, or who displays only 
early signs or symptoms of pathology, disease, or disorder, diminishes, prevents, or 
decreases the risk of the subject developing a pathology, disease, or disorder. A 
"prophylactically useful" agent or compound (e.g., nucleic acid or polypeptide) refers to 
an agent or compound that is useful in diminishing, preventing, treating, or decreasing 

25 development of pathology, disease or disorder. 

A "therapeutic treatment" is a treatment administered to a subject who displays 
symptoms or signs of pathology, disease, or disorder, in which treatment is administered 
to the subject for the purpose of diminishing or eliminating those signs or symptoms of 
pathology, disease, or disorder. A "therapeutic activity" is an activity of an agent, such as 

30 a nucleic acid, vector, gene, polypeptide, protein, substance, or composition thereof, that 
eliminates or diminishes signs or symptoms of pathology, disease or disorder, when 
administered to a subject suffering from such signs or symptoms. A "therapeutically 
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useful" agent or compound (e.g., nucleic acid or polypeptide) indicates that an agent or 
compound is useful in diminishing, treating, or eliminating such signs or symptoms of a 
pathology, disease or disorder. 

The term "gene" broadly refers to any segment of DNA associated with a biological 

5 function. Genes include coding sequences and/or regulatory sequences required for their 
expression. Genes also include non-expressed DNA nucleic acid segments that, e.g., 
form recognition sequences for other proteins (e.g., promoter, enhancer, or other 
regulatory regions). Genes can be obtained from a variety of sources, including cloning 
from a source of interest or synthesizing from known or predicted sequence information, 

1 0 and may include sequences designed to have desired parameters. 

Generally, the nomenclature used hereafter and the laboratory procedures in cell 
culture, molecular genetics, molecular biology, nucleic acid chemistry, and protein 
chemistry described below are those well known and commonly employed by those of 
ordinary skill in the art. Standard techniques, such as described in Sambrook et al., 

15 Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor 

Laboratory, Cold Spring Harbor, New York, 1989 (hereinafter "Sambrook") and Current 
Protocols in Molecular Biology, F. M. Ausubel et al., eds., Current Protocols, a joint 
venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. (1994, 
supplemented through 1999) (hereinafter "Ausubel"), are used for recombinant nucleic 

20 acid methods, nucleic acid synthesis, cell culture methods, and transgene incorporation, 
e.g., electroporation, injection, gene gun, impressing through the skin, and lipofection. 
Generally, oligonucleotide synthesis and purification steps are performed according to 
specifications. The techniques and procedures are generally performed according to 
conventional methods in the art and various general references which are provided 

25 throughout this document. The procedures therein are believed to be well known to those 
of ordinary skill in the art and are provided for the convenience of the reader. 

As used herein, an "antibody" refers to a protein comprising one or more 
polypeptides substantially or partially encoded by immunoglobulin genes or fragments of 
immunoglobulin genes. The term antibody is used to mean whole antibodies and binding 

30 fragments thereof. The recognized immunoglobulin genes include the kappa, lambda, 
alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad 
immunoglobulin variable region genes. Light chains are classified as either kappa or 
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lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn 
define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. A typical 
immunoglobulin (e.g., antibody) structural unit comprises a tetramer. Each tetramer is 
composed of two identical pairs of polypeptide chains, each pair having one "light" 
5 (about 25 KDa) and one "heavy" chain (about 50-70 KDa). The N-terminus of each chain 
defines a variable region of about 100 to 1 10 or more amino acids primarily responsible 
for antigen recognition. The terms variable light chain (VL) and variable heavy chain 
(VH) refer to these light and heavy chains, respectively. 

Antibodies also include single-armed composite monoclonal antibodies, single chain 

10 antibodies, including single chain Fv (sFv) antibodies in which a variable heavy and a 
variable light chain are joined together (directly or through a peptide linker) to form a 
continuous polypeptide, as well as diabodies, tribodies, and tetrabodies (Pack et al. (1995) 
J Mol Biol 246:28; Biotechnol 1 1 : 127 1 ; and Biochemistry 3 1 : 1579). The antibodies are, 
e.g., polyclonal, monoclonal, chimeric, humanized, single chain, Fab fragments, 

15 fragments produced by an Fab expression library, or the like. 

The term "epitope" means a protein determinant capable of specific binding to an 
antibody. Epitopes usually consist of chemically active surface groupings of molecules 
such as amino acids or sugar side chains and usually have specific three dimensional 
structural characteristics, as well as specific charge characteristics. Conformational and 

20 nonconformational epitopes are distinguished in that the binding to the former but not the 
latter is lost in the presence of denaturing solvents. 

An "antigen-binding fragment" of an antibody is a peptide or polypeptide fragment 
of the antibody that binds an antigen. An antigen-binding site is formed by those amino 
acids of the antibody that contribute to, are involved in, or affect the binding of the 

25 antigen. See Scott, T.A. and Mercer, E.L, Concise Encyclopedia: Biochemistry and 

Molecular Biology (de Gruyter, 3d ed. 1997), and Watson, J.D. et al., Recombinant DNA 
(2d ed. 1992) (hereinafter "Watson, Recombinant DNA"), each of which is incorporated 
herein by reference in its entirety for all purposes. 

The term "screening" describes, in general, a process that identifies optimal 

30 molecules of the present invention, such as, e.g., polypeptides of the invention, and 
related fusion polypeptides including the same, and nucleic acids encoding all such 
molecules. Several properties of these respective molecules can be used in selection and 
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screening, for example: an ability of a respective molecule to bind a ligand or to a 
receptor, to inhibit cell proliferation, to inhibit viral replication in virus-infected cells, to 
induce or inhibit cellular cytokine production, to alter an immune response, e.g., induce or 
inhibit a desired immune response, in a test system or an in vitro, ex vivo or in vivo 
5 application. In the case of antigens, several properties of the antigen can be used in 

selection and screening including antigen expression, folding, stability, immunogenicity 
and presence of epitopes from several related antigens. 

"Selection" is a form of screening in which identification and physical separation are 
achieved simultaneously by, e.g., expression of a selection marker, which, in some 

10 genetic circumstances, allows cells expressing the marker to survive while other cells die 
(or vice versa). Screening markers include, for example, luciferase, beta-galactosidase 
and green fluorescent protein, and the like. Selection markers include drug and toxin 
resistance genes, and the like. Another mode of selection involves physical sorting based 
on a detectable event, such as binding of a ligand to a receptor, reaction of a substrate 

15 with an enzyme, or any other physical process which can generate a detectable signal 

either directly (e.g., by utilizing a chromogenic substrate or ligand) or indirectly (e.g., by 
reacting with a chromogenic secondary antibody). Selection by physical sorting can by 
accomplished by a variety of methods, such as by FACS in whole cell or microdroplet 
formats. 

20 An "exogenous" nucleic acid," "exogenous DNA segment," "heterologous 

sequence," or "heterologous nucleic acid," as used herein, is one that originates from a 
source foreign to the particular host cell, or, if from the same source, is modified from its 
original form. Thus, a heterologous gene in a host cell includes a gene that is endogenous 
to the particular host cell, but has been modified. Modification of a heterologous 

25 sequence in the applications described herein typically occurs through the use of recursive 
sequence recombination. The terms refer to a DNA segment which is foreign or 
heterologous to the cell, or homologous to the cell but in a position within the host cell 
nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are 
expressed to yield exogenous polypeptides. 

30 The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and 

polymers thereof in either single- or double-stranded form. Unless specifically limited, 
the term encompasses nucleic acids containing known analogues of natural nucleotides 
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which have similar binding properties as the reference nucleic acid and are metabolized in 
a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a 
particular nucleic acid sequence also implicitly encompasses conservatively modified 
variants thereof (e.g., degenerate codon substitutions) and complementary sequences and 
5 as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions 
may be achieved by generating sequences in which the third position of one or more 
selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues 
(Batzer et al. (1991) Nucleic Acid Res 19:5081; Ohtsuka et al. (1985) J Biol Chem 
260:2605-2608; Cassol et al. (1992) ; Rossolini et al. (1994) Mol Cell Probes 8:91-98). 
10 The term nucleic acid is used interchangeably with gene, cDNA, and mRNA encoded by 
a gene. 

"Nucleic acid derived from a gene" refers to a nucleic acid for whose synthesis the 
gene, or a subsequence thereof, has ultimately served as a template. Thus, an mRNA, a 
cDNA reverse transcribed from an mRNA, an RNA transcribed from that cDNA, a DNA 

15 amplified from the cDNA, an RNA transcribed from the amplified DNA, etc, are all 

derived from the gene and detection of such derived products is indicative of the presence 
and/or abundance of the original gene and/or gene transcript in a sample. 
A nucleic acid is "operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For instance, a promoter or enhancer is operably linked to 

20 a coding sequence if it increases the transcription of the coding sequence. Operably linked 
means that the DNA sequences being linked are typically contiguous and, where 
necessary to join two protein coding regions, contiguous and in reading frame. However, 
since enhancers generally function when separated from the promoter by several 
kilobases and intronic sequences may be of variable lengths, some polynucleotide 

25 elements may be operably linked but not contiguous. 

The term "cytokine" includes, for example, interleukins, interferons, chemokines, 
hematopoietic growth factors, tumor necrosis factors and transforming growth factors. In 
general these are low molecular weight proteins that regulate maturation, activation, 
proliferation, and differentiation of cells of the immune system. 

30 In the present description and claims, any reference to "a" component, e.g. in the context 
of a non-polypeptide moiety, an amino acid residue, a substitution, a buffer, a cation, etc., 
is intended to refer to one or more of such components, unless stated otherwise or unless 
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it is clear from the particular context that this is not the case. For example, the expression 
"a component selected from the group consisting of A, B and C" is intended to include all 
combinations of A, B and C, e.g., A, B, C, A+B, A+C, B+C or A+B+C 
Various additional terms are defined or otherwise characterized herein. 

MOLECULES AND METHODS OF THE INVENTION 

Molecules of the invention (e.g., polypeptides of the invention, conjugates of the 
invention, and nucleic acids encoding said polypeptides) are useful for the treatment of 
diseases and disorders which are responsive to treatment by interferon-alpha, particularly 
diseases associated with viral infection, such as, for example, infection by HCV. 

Patients with chronic HCV infection have viral loads typically in the range of 10 4 - 
10 7 copies of HCV RNA/ml of serum prior to treatment. Upon treatment with IFN-alpha, 
viral load in these patients characteristically undergoes two distinct log-linear phases of 
decline (Figure IB; Neumann A.U., et al (1998) Science 282:103-107). The initial rapid 
drop in viral load that occurs within the first two days of IFN-alpha therapy is believed to 
be due to interferon-alpha mediated reduction in virus production in the infected liver 
cells and concomitant protection of naive cells against infection. The rate of viral 
production reaches a new steady state at about two days, at which time a second less rapid 
log-linear phase of viral clearance is observed. This second phase of viral clearance is 
generally believed to be due in part to T-cell mediated killing of infected liver cells 
(Neumann, et aL, supra). IFN-alpha is believed to play a key role in this biological 
response through the stimulation of antigen specific T cells to differentiate into ThI cells. 
Furthermore, the mode of action of Ribavirin is believed to be due to augmentation of the 
Th 1 response, and is thought to be the mechanistic basis of its efficacy in combination 
therapy with IFN-alpha. HCV-infected patients who are non-responsive to interferon- 
alpha therapies currently in use (generally termed "non-responders") exhibit much 
shallower viral load clearance profiles (Figure 1 A). 

Although the present invention is not intended to be limited by a particular theory of 
underlying mechanism, it is proposed that antiviral activity in surrogate assay systems 
(such as those described in more detail herein) may be predictive of interferon-alpha 
efficacy, in for example the first phase of viral clearance. Exemplary antiviral assays, 
described in the Examples section, monitor the effectiveness of IFN-alpha in protecting 
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against the cytopathic effect of Encephalomyocarditis Virus (EMCV) in HuH7 human 
liver-derived cells as a surrogate system for effectiveness against HCV in human liver 
cells. Other useful virus / cell assay systems include EMCV in WISH human amnion- 
derived cells, EMCV in HeLa human cervical carcinoma cells, Vesicular Stomatitis 
Virus (VS V) in HuH7 cells, Vaccinia Virus (VV) in HeLa cells, Yellow Fever Virus 
(YFV) in HepG2 human hepatocarcinoma cells, as well as Human Immunodeficiency 
Virus (HIV) in primary CD4+ T-cells. Example 2 shows antiviral activities of 
representative polypeptides of the invention in the EMCV/HuH7 and EMCV/HeLa 
antiviral activity assays. 

Other surrogate assay systems useful for monitoring effectiveness against HCV in 
infected hepatocytes include HCV replicon systems, as described, for example, by 
Lohmann V., et al y (1999) Science 285(5424):285-293; Randall G. and Rice CM. (2001) 
Curr Opin Infect Dis 14(6):743-7477; and Bartenschlager, R. (2002) Nature 
Reviews/Drug Discovery 1:91 1. An example of a useful in vivo host system for 
monitoring HCV antiviral efficacy is a chimeric human liver SCID mouse, as described 
by Mercer, et al (2001) Nature Medicine 7(8):927-933. 

It is furthermore proposed, without being limited by theory, that enhancement of ThI 
differentiation and/or suppression of T H 2 differentiation by IFN-alpha may be a 
contributing factor to interferon-alpha efficacy, for example, in the second phase of viral 
clearance. According to this theory, evolved IFN-alphas with increased potency in these 
biological activities (i.e., enhancement of T H 1 differentiation and/or suppression of T H 2 
differentiation) would be predicted to have increased efficacy relative to, for example, 
currently approved therapeutic interferon-alpha molecules administered at the same 
dosage. An exemplary assay, described in the Examples section herein, monitors the 
enhancement of T H 1 differentiation and/or suppression of T H 2 differentiation by IFN- 
alpha on naive ThO cells, by measuring production of cytokines associated with the T H 1- 
phenotype (e.g., IFN-gamma) and/or the T H 2-phenotype (e.g., IL-5, IL4) via ELISA or 
via intracellular staining and FACS sorting. Example 3 shows ThI differentiation 
activities of representative polypeptides of the invention. 

The therapeutic efficacy of IFN-alpha molecules tends to be diminished in part due 
to dose-limiting toxicities, e.g. thrombocytopenia and neutropenia. Although the present 
invention is not intended to be limited by a particular theory of underlying mechanism, it 
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is proposed that such toxicity may be associated with anti-proliferative effects of IFN- 
alpha on platelet and neutrophil precursors, and that antiproliferative activity in surrogate 
assay systems (such as those described herein) may be predictive of the relative toxicity 
of an interferon-alpha molecule. Thus, dose-limiting toxicities associated with IFN- 
alpha therapy may be diminished in IFN-alpha molecules that exhibit reduced 
antiproliferative activity relative to, for example, currently approved therapeutic 
interferon-alpha molecules, such as ROFERON®-A (Interferon alfa-2a, recombinant; 
Hoffmann-La Roche Inc.), INTRON® A (Interferon alfa-2b, recombinant; Schering 
Corporation), and INFERGEN® (interferon alfacon-1; InterMune, Inc.). An exemplary 
antiproliferative activity assay, described in the Examples section herein, monitors the 
effect of IFN-alpha on the proliferation of human Daudi lymphoid cells. Alternatively, or 
in addition, dose-limiting toxicities may be reduced as a result of administering more 
therapeutically active molecules, which would permit dosing in lower concentrations or at 
lower frequency than currently approved molecules. Example 4 shows the Daudi 
antiproliferative activities of representative polypeptides of the invention. 

It is an object of the invention to provide novel interferon-alpha polypeptides, and 
nucleic acids which encode the polypeptides. Polypeptides of the invention are useful for 
the treatment of diseases and disorders which are responsive to treatment by interferon- 
alpha, particularly diseases associated with viral infection, such as, for example, infection 
by HCV. Some polypeptides of the invention exhibit an interferon-alpha activity, such 
as, for example, antiviral activity, antiproliferative activity, and/or ThI differentiation 
activity. Some polypeptides of the invention exhibit one or more of the following 
properties: increased or decreased antiviral activity compared to a reference IFN-alpha 
polypeptide; increased or decreased ThI differentiation activity compared to a reference 
IFN-alpha polypeptide; increased or decreased antiproliferative activity compared to a 
reference IFN-alpha polypeptide. The reference IFN-alpha polypeptide may comprise a 
sequence of a non-naturally occurring interferon-alpha, such as IFN-alpha Conl (SEQ ID 
NO:58), or may comprise a sequence of a naturally-occurring (i.e., wild-type) interferon- 
alpha polypeptide. Examples of sequences of naturally occurring interferon-alpha 
polypeptides include sequences of human IFN-alpha polypeptides, such as, for example, 
huIFN-alpha 2b (SEQ ID NO:46), huIFN-alpha 2a (SEQ ID NO:47), huIFN-alpha 2c 
(SEQ ID NO:46 with position 34 = Arg), huIFN-alpha 8b (SEQ ID NO:52), huIFN-alpha 
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8a (SEQ ID NO:52 with positions 98 = Val, 99 = Leu, 100 = Cys, and 101 = Asp), 
huIFN-alpha 8c (SEQ ID NO:52 with position 161 = Asp and amino acids at positions 
162-166 deleted), huIFN-alpha 14a (SEQ ID NO:54), huIFN-alpha 14c (SEQ ID NO:54 
with position 152 = Leu), or a sequence of any other naturally occurring human interferon 
alpha polypeptide, such as those listed in Allen G. and Diaz M.O. (1996), supra. 

In another aspect, the invention provides interferon-alpha polypeptides which exhibit 
enhanced efficacy in clearing a virus from virus-infected cells, compared to a reference 
interferon-alpha molecule, such as one currently employed as a therapeutic (such as, for 
example, ROFERON-A, INTRON A, or INFERGEN). Exemplary viruses include, but 
are not limited to, viruses of the Flaviviridae family, such as, for example, Hepatitis C 
Virus, Yellow Fever Virus, West Nile Virus, Japanese Encephalitis Virus, Dengue Virus, 
and Bovine Viral Diarrhea Virus; viruses of the Hepadnaviridae family, such as, for 
example, Hepatitis B Virus; viruses of the Picornaviridae family, such as, for example, 
Encephalomyocarditis Virus, Human Rhinovirus, and Hepatitis A Virus; viruses of the 
Retroviridae family, such as, for example, Human Immunodeficiency Virus, Simian 
Immunodeficiency Virus, Human T-Lymphotropic Virus, and Rous Sarcoma Virus; 
viruses of the Coronaviridae family, such as, for example, SARS coronavirus; viruses of 
the Rhabdoviridae family, such as, for example, Rabies Virus and Vesicular Stomatitis 
Virus, viruses of the Paramyxoviridae family, such as, for example, Respiratory Syncytial 
Virus and Parainfluenza Virus, viruses of the Papillomaviridae family, such as, for 
example, Human Papillomavirus, and viruses of the Herpesviridae family, such as, for 
example, Herpes Simplex Virus. Such enhanced efficacy may arise from enhanced 
antiviral activity, enhanced ThI -differentiation activity, or both, relative to the reference 
molecule. For example, some interferon-alpha polypeptides of the invention may be 
particularly useful in clearing viruses or viral strains that show poor response to treatment 
with interferon-alpha molecules currently in use, such as, for example, Genotype 1 of 
HCV. 

Some polypeptides of the invention exhibit an increased ratio of (antiviral 
activity/antiproliferative activity) compared to the reference IFN-alpha molecule, and/or 
an increased ratio of (T H 1 differentiation activity/antiproliferative activity) compared to 
the reference IFN-alpha molecule. Polypeptides exhibiting such properties may be 
particularly effective in treatment of viral infections, such as, for example, infection by a 
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virus listed above. Some such polypeptides may, for example, provide enhanced 
therapeutic efficacy over currently-approved interferon-alpha molecules in the treatment 
of HCV, in one or both phases of the biphasic viral clearance profile, and/or may exhibit 
reduced toxicity. Some such polypeptides may provide enhanced therapeutic efficacy 

5 over currently-approved interferon-alpha molecules in the treatment of Genotype 1 HCV. 
It is another object of the invention to provide conjugates, such conjugates 
comprising one or more non-polypeptide moiety linked to a polypeptide of the invention, 
which conjugate exhibits an interferon-alpha activity (such as one or more of the 
activities listed above), and which optionally exhibits other desirable properties, such as 

10 increased serum half-life and/or functional in vivo half-life, and/or decreased antigenicity, 
compared to the non-conjugated polypeptide. Some such conjugates may exhibit 
enhanced efficacy in clearing a virus from cells infected with the virus, compared to a 
reference interferon-alpha molecule, such as an interferon-alpha conjugate currently 
employed as a therapeutic (such as, for example, PEGAS YS® (Peginterferon alfa-2a; 

15 Hoffmann-La Roche, Inc.) or PEG-INTRON® (peginterferon alfa-2b; Schering 
Corporation). Exemplary viruses include, but are not limited to, viruses of the 
Flaviviridae family, such as, for example, Hepatitis C Virus, Yellow Fever Virus, West 
Nile Virus, Japanese Encephalitis Virus, Dengue Virus, and Bovine Viral Diarrhea Virus; 
viruses of the Hepadnaviridae family, such as, for example, Hepatitis B Virus; viruses of 

20 the Picornaviridae family, such as, for example, Encephalomyocarditis Virus, Human 
Rhinovirus, and Hepatitis A Virus; viruses of the Retroviridae family, such as, for 
example, Human Immunodeficiency Virus, Simian Immunodeficiency Virus, Human T- 
Lymphotropic Virus, and Rous Sarcoma Virus; viruses of the Coronaviridae family, such 
as, for example, SARS coronavirus; viruses of the Rhabdoviridae family, such as, for 

25 example, Rabies Virus and Vesicular Stomatitis Virus, viruses of the Paramyxoviridae 
family, such as, for example, Respiratory Syncytial Virus and Parainfluenza Virus, 
viruses of the Papillomaviridae family, such as, for example, Human Papillomavirus, and 
viruses of the Herpesviridae family, such as, for example, Herpes Simplex Virus. Such 
enhanced efficacy may arise from enhanced antiviral activity, enhanced T H 1- 

30 differentiation activity, or both, relative to the reference molecule. For example, some 

interferon-alpha conjugates of the invention may be particularly useful in clearing viruses 
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or viral strains that show poor response to treatment with interferon-alpha molecules 
currently in use, such as, for example, Genotype 1 of HCV. 

Some conjugates of the invention exhibit an increased ratio of (antiviral 
activity/antiproliferative activity) compared to the reference IFN-alpha molecule, and/or 
an increased ratio of (T H 1 differentiation activity/antiproliferative activity) compared to 
the reference IFN-alpha molecule. Conjugates exhibiting such properties may be 
particularly effective in treatment of viral infections, such as infection by a virus listed 
above, such as, for example, HCV. Some such conjugates may, for example, provide 
enhanced therapeutic efficacy over currently-approved interferon-alpha molecules in the 
treatment of HCV, in one or both phases of the biphasic viral clearance profile, and/or 
may exhibit reduced toxicity. Some such conjugates may provide enhanced therapeutic 
efficacy over currently-approved interferon-alpha molecules in the treatment of Genotype 
1 HCV. 

It is another object of the invention to provide a method of inhibiting viral 
replication in virus-infected cells, the method comprising administering to the virus- 
infected cells a polypeptide or conjugate of the invention in an amount effective to inhibit 
viral replication in said cells. The invention also provides a method of reducing the 
number of copies of a virus in virus-infected cells, comprising administering to the virus- 
infected cells a polypeptide or conjugate of the invention in an amount effective to reduce 
the number of copies of the virus in said cells. The virus may be a virus of the 
Flaviviridae family, such as, for example, Hepatitis C Virus, Yellow Fever Virus, West 
Nile Virus, Japanese Encephalitis Virus, Dengue Virus, or Bovine Viral Diarrhea Virus; 
a virus of the Hepadnaviridae family, such as, for example, Hepatitis B Virus; a virus of 
the Picornaviridae family, such as, for example, Encephalomyocarditis Virus, Human 
Rhinovirus, or Hepatitis A Virus; a virus of the Retroviridae family, such as, for example, 
Human Immunodeficiency Virus, Simian Immunodeficiency Virus, Human T- 
Lymphotropic Virus, or Rous Sarcoma Virus; a virus of the Coronaviridae family, such 
as, for example, SARS coronavirus; a virus of the Rhabdoviridae family, such as, for 
example, Rabies Virus or Vesicular Stomatitis Virus; a virus of the Paramyxoviridae 
family, such as, for example, Respiratory Syncytial Virus or Parainfluenza Virus; a virus 
of the Papillomaviridae family, such as, for example, Human Papillomavirus; or a virus 
of the Herpesviridae family, such as, for example, Herpes Simplex Virus. The virus may 
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for example be an RNA virus, such as HCV, a DNA virus, such as HBV, or a retrovirus, 
such as HIV. The cells may be in culture or otherwise isolated from a mammal (i.e., in 
vitro or ex vivo), or may be in vivo, e.g., in a mammal (e.g. such as a SCID mouse model 
as described by Mercer, et al (2001) Nature Medicine. 7(8): 927-933), in a primate, or in 
man. 

The invention also provides a method of enhancing T H 1 differentiation of ThO cells, 
comprising administering to a population comprising T H 0 cells a polypeptide or conjugate 
of the invention in an amount effective to increase the production of a cytokine associated 
with the T H l-phenotype (e.g., IFN-gamma) and/or decrease the production of a cytokine 
associated with the T H 2-phenotype (e.g., IL-4 or IL-5) in said population. The population 
may be in culture or otherwise isolated from a mammal (i.e., in vitro or ex vivo), or may 
be in vivo, e.g., in a mammal, in a primate, or in man. 

The invention also provides a method of inhibiting proliferation of a cell population, 
comprising contacting the cell population with a polypeptide, variant, or conjugate of the 
invention in an amount effective to decrease proliferation of the cell population. The cell 
population may be in culture or otherwise isolated from a mammal (i.e., in vitro or ex 
vivo), or may be in vivo, e.g., in a mammal, a primate, or man. 

These and other objects of the invention are discussed in more detail below. 

POLYPEPTIDES OF THE INVENTION 

The invention provides novel interferon-alpha polypeptides, collectively referred to 
herein as "polypeptides of the invention". The term "polypeptide(s) of the invention" is 
intended throughout to include variants of the polypeptide sequences disclosed herein. 
Also included in this invention are fusion proteins comprising polypeptides of the 
invention, and conjugates comprising polypeptides of the invention. 

Fragments of various interferon-alpha coding sequences were recursively 
recombined to form libraries comprising recombinant polynucleotides, from which 
polypeptides of the invention were derived. Methods for obtaining libraries of 
recombinant polynucleotides and/or for obtaining diversity in nucleic acids used as the 
substrates for recursive sequence recombination are also described infra. 

Exemplary polypeptides of the invention include polypeptides comprising 
sequences identified herein as SEQ ID NOs:2-35 and SEQ ID NOs: 37-44 (i.e., SEQ ID 
NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ 
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ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID 
NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID 
NO: 18, SEQ ID NO: 19, SEQ ID NO: 10, SEQ ID NO:21, SEQ ID NO:22, SEQ ID 
NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID 
5 NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:38, SEQ ID 
NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, and SEQ ID 
NO:44). Some such polypeptides further comprise one or more additional amino acid(s), 
such as, for example, a methionine added to the N-terminus. The invention also provides 

10 fusion proteins and conjugates comprising these polypeptides, and isolated or 
recombinant nucleic acids encoding these polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID NO: 1 
and (b) comprises one or more substitution(s) relative to SEQ ID NO: 1 selected from the 

15 group consisting of F48A/L; V51P; F55A; F65A; F68P; LI 1 1A; and VI 14P. Some such 
polypeptides comprise a sequence which differs from SEQ ID NO: 1 in 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 
in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 
1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 

20 8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. In some instances, the polypeptide further comprises one or more 
substitution(s) relative to SEQ ID NO: 1 selected from the group consisting of H47Q; 
V51A; Q52P/E; A53T; F55S; L56V; F57L; Y58H; M61I; N113K; V114E; and E160D. 

25 In some instances, the polypeptide exhibits an interferon-alpha activity (e.g., antiviral 
activity, T H 1 differentiation activity, and/or antiproliferative activity). Some such 
polypeptides further comprise one or more additional amino acid(s), such as a 
methionine added to the N-terminus. The invention also provides fusion proteins and 
conjugates comprising these polypeptides, and isolated or recombinant nucleic acids 

30 encoding these polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID NO:l 
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and (b) comprises the substitution F48A/L relative to SEQ ID NO: 1. Some such 
polypeptides comprise a sequence which differs from SEQ ID NO: 1 in 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 
in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 

5 1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 
8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. Some such polypeptides comprise the substitution F48A relative to 
SEQ ID NO: 1. In some instances, the polypeptide further comprises one or more 

10 substitution(s) relative to SEQ ID NO: 1 selected from the group consisting of H47Q; 
V51A/P; Q52P/E; A53T; F55A/S; L56V; F65A; F57L; Y58H; M61I; F68P; LI 1 1A; 
Nl 1 3K; VI 14E/P; and E160D. In some instances, the polypeptide exhibits an 
interferon-alpha activity (e.g., antiviral activity, T H 1 differentiation activity, and/or 
antiproliferative activity). Some such polypeptides further comprise one or more 

15 additional amino acid(s), such as a methionine added to the N-terminus. The invention 
also provides fusion proteins and conjugates comprising these polypeptides, and isolated 
or recombinant nucleic acids encoding these polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID NO:l 

20 and (b) comprises the substitution F55A relative to SEQ ID NO: 1. Some such 

polypeptides comprise a sequence which differs from SEQ ID NO:l in 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 
in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 
1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 

25 8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. In some instances, the polypeptide further comprises one or more 
substitution(s) relative to SEQ ID NO:l selected from the group consisting of H47Q; 
F48A/L; V51A/P; Q52P/E; A53T; L56V; F65A; F57L; Y58H; M61I; F68P; LI 11 A; 

30 Nl 13K; VI 14E/P; and E160D. In some instances, the polypeptide exhibits an 

interferon-alpha activity (e.g., antiviral activity, T H 1 differentiation activity, and/or 
antiproliferative activity). Some such polypeptides further comprise one or more 
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additional amino acid(s), such as a methionine added to the N-terminus. The invention 
also provides fusion proteins and conjugates comprising these polypeptides, and isolated 
or recombinant nucleic acids encoding these polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID NO:l 
and (b) comprises the substitution F65A relative to SEQ ID NO: 1. Some such 
polypeptides comprise a sequence which differs from SEQ ID NO:l in 1,2, 3, 4, 5, 6, 7, 
8, 9, 10, 1 1, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 
in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 
1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 
8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. In some instances, the polypeptide further comprises one or more 
substitutions relative to SEQ ID NO: 1 selected from the group consisting of H47Q; 
F48A/L; V51 A/P; Q52P/E; A53T; F55A/S; L56V; F57L; Y58H; M61I; F68P; LI 1 1A; 
Nl 13K; VI 14E/P; and E160D. In some instances, the polypeptide exhibits an 
interferon-alpha activity (e.g., antiviral activity, ThI differentiation activity, and/or 
antiproliferative activity). Some such polypeptides further comprise one or more 
additional amino acid(s), such as a methionine added to the N-terminus. The invention 
also provides fusion proteins and conjugates comprising these polypeptides, and isolated 
or recombinant nucleic acids encoding these polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID NO:l 
and (b) comprises the substitution LI 1 1A relative to SEQ ID-NO: 1. Some such 
polypeptides comprise a sequence which differs from SEQ ID NO:l in 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 
in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 
1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 
8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. In some instances, the polypeptide further comprises one or more 
substitutions relative to SEQ ID NO: 1 selected from the group consisting of H47Q; 
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F48A/L; V51A/P; Q52P/E; A53T; F55A/S; L56V; F65A; F57L; Y58H; M61I; F68P; 
Nl 13K; VI 14E/P; and E160D. In some instances, the polypeptide exhibits an 
interferon-alpha activity (e.g., antiviral activity, T H 1 differentiation activity, and/or 
antiproliferative activity). Some such polypeptides further comprise one or more 
5 additional amino acid(s), such as a methionine added to the N-terminus. The invention 
also provides fusion proteins and conjugates comprising these polypeptides, and isolated 
or recombinant nucleic acids encoding these polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 0 to 16 amino acid positions from SEQ LD 

10 NO: 13 and (b) comprises one or more of: Ala or Leu at position 48; Pro at position 51; 
Ala at position 55; Ala at position 65; Pro at position 68; Ala at position 111; and Pro at 
position 1 14 (position numbering relative to SEQ ID NO: 13). Some such polypeptides 
comprise a sequence which differs from SEQ ID NO: 13 in 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 0-15 amino acid positions, in 0-14 

15 amino acid positions, in 0-13 amino acid positions, in 0-12 amino acid positions, in 0-1 1 
amino acid positions, in 0-10 amino acid positions, in 0-9 amino acid positions, in 0-8 
amino acid positions, in 0-7 amino acid positions, in 0-6 amino acid positions, in 0-5 
amino acid positions, in 0-4 amino acid positions, in 0-3 amino acid positions, in 0-2 
amino acid positions, or in 0-1 amino acid positions. Some such polypeptides comprise 

20 an Ala or a Leu at position 48. Some such polypeptides comprise an Ala at position 48. 
In some instances, the polypeptide further comprises one or more substitution(s) relative 
to SEQ ID NO:13 selected from the group consisting of H47Q; V51A; Q52P/E; A53T; 
F55S; L56V; F57L; Y58H; M61I; Nl 13K; VI 14E; and E160D. In some instances, the 
polypeptide exhibits an interferon-alpha activity (e.g., antiviral activity, ThI 

25 differentiation activity, and/or antiproliferative activity). Some such polypeptides further 
comprise one or more additional amino acid(s), such as a methionine added to the N- 
terminus. The invention also provides fusion proteins and conjugates comprising these 
polypeptides, and isolated or recombinant nucleic acids encoding these polypeptides. 
The invention also provides isolated or recombinant polypeptides which each 

30 comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID 

NO: 36 and (b) comprises one or more substitution(s) relative to SEQ ID NO: 36 selected 
from the group consisting of M21A; I24P; F48A/L; T51P; S55A; F65A; F68P; F90A; 
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M93P; LI 11 A; V114P; F124A; I127P; and E160D. Some such polypeptides comprise a 
sequence which differs from SEQ ID NO:36 in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 
15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, in 1-14 amino acid 
positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 1-11 amino acid 

5 positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1-8 amino acid 
positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 amino acid 
positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 amino acid 
positions. In some instances, the polypeptide further comprises one or more 
substitution(s) relative to SEQ ID NO: 36 selected from the group consisting of P26L; 

10 H47Q; T51V; S55P/F; V56L; H58Y; L60M; F90Y; M93L; N95D; Nl 13K; VI 14E; 

R125Q; T132K; and L154F. In some instances, the polypeptide exhibits an interferon- 
alpha activity (e.g., antiviral activity, T H 1 differentiation activity, and/or antiproliferative 
activity). Some such polypeptides further comprise one or more additional amino 
acid(s), such as a methionine added to the N-terminus. The invention also provides 

15 fusion proteins and conjugates comprising these polypeptides, and isolated or 
recombinant nucleic acids encoding these polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID 
NO:36 and (b) comprises the substitution F48A/L relative to SEQ ID NO:36. Some such 

20 polypeptides comprise a sequence which differs from SEQ ID NO:36 in 1,2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 
in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 
1-1 1 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 
8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 

25 amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. Some such polypeptides comprise the substitution F48A relative to 
SEQ ID NO: 36. In some instances, the polypeptide further comprises one or more 
substitution(s) relative to SEQ ID NO:36 selected from the group consisting of M21A; 
I24P; P26L; H47Q; T51V/P; S55P/F/A; V56L; H58Y; L60M; F65A; F68P; F90Y/A; 

30 M93L/P; N95D; LI 11 A; N113K; V114E/P; F124A; R125Q; I127P; T132K; L154F; and 
E160D. In some instances, the polypeptide exhibits an interferon-alpha activity (e.g,, 
antiviral activity, ThI differentiation activity, and/or antiproliferative activity). Some 
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such polypeptides further comprise one or more additional amino acid(s), such as a 
methionine added to the N-terminus. The invention also provides fusion proteins and 
conjugates comprising these polypeptides, and isolated or recombinant nucleic acids 
encoding these polypeptides. 
5 The invention also provides isolated or recombinant polypeptides which each 

comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID 
NO:36 and (b) comprises the substitution F90A relative to SEQ ID NO: 36. Some such 
polypeptides comprise a sequence which differs from SEQ ID NO:36 in 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 

10 in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 
1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 
8 amino acid positions, in 1-7 amino acid positions, in 1^6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. In some instances, the polypeptide further comprises one or more 

15 substitution(s) relative to SEQ ID NO:36 selected from the group consisting of M21A; 
I24P; P26L; H47Q; F48A/L; T51V/P; S55P/F/A; V56L; H58Y; L60M; F65A; F68P; 
M93L/P; N95D; L111A; N113K; V114E/P; F124A; R125Q; I127P; T132K; L154F; and 
E160D. In some instances, the polypeptide exhibits an interferon-alpha activity (e.g., 
antiviral activity, T H 1 differentiation activity, and/or antiproliferative activity). Some 

20 such polypeptides further comprise one or more additional amino acid(s), such as a 
methionine added to the N-terminus. The invention also provides fusion proteins and 
conjugates comprising these polypeptides, and isolated or recombinant nucleic acids 
encoding these polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 

25 comprise a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID 

NO:36 and (b) comprises the substitution E160D relative to SEQ ID NO:36. Some such 
polypeptides comprise a sequence which differs from SEQ ID NO: 36 in 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 1 1, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 
in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 

30 1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 
8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
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amino acid positions. In some instances, the polypeptide further comprises one or more 
substitution(s) relative to SEQ ID NO:36 selected from the group consisting of M21A; 
I24P; P26L; H47Q; F48A/L; T51V/P; S55P/F/A; V56L; H58Y; L60M; F65A; F68P; 
F90Y/A; M93L/P; N95D; L111A; N113K; V114E/P; F124A; R125Q; I127P; T132K; and 

5 L154F. In some instances, the polypeptide exhibits an interferon-alpha activity (e.g., 
antiviral activity, T H 1 differentiation activity, and/or antiproliferative activity). Some 
such polypeptides further comprise one or more additional amino acid(s), such as a 
methionine added to the N-terminus. The invention also provides fusion proteins and 
conjugates comprising these polypeptides, and isolated or recombinant nucleic acids 

10 encoding these polypeptides. 

The invention also provides isolated or recombinant polypeptides which each 
comprise a sequence which (a) differs in 0 to 16 amino acid positions from SEQ ID 
NO: 38 and (b) comprises one or more of: Ala at position 21; Pro at position 24; Ala or 
Leu at position 48; Pro at position 51; Ala at position 55; Ala at position 65; Pro at 

15 position 68; Ala at position 90; Pro at position 93; Ala at position 111; Pro at position 

1 14; Ala at position 124; Pro at position 127; and Glu at position 160 (position numbering 
relative to SEQ ID NO:38). Some such polypeptides comprise a sequence which differs 
from SEQ ID NO:38 in 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid 
positions, e.g., in 0-15 amino acid positions, in 0-14 amino acid positions, in 0-13 amino 

20 acid positions, in 0-12 amino acid positions, in 0-1 1 amino acid positions, in 0-10 amino 
acid positions, in 0-9 amino acid positions, in 0-8 amino acid positions, in 0-7 amino acid 
positions, in 0-6 amino acid positions, in 0-5 amino acid positions, in 0-4 amino acid 
positions, in 0-3 amino acid positions, in 0-2 amino acid positions, or in 0-1 amino acid 
positions. Some such polypeptides comprise an Ala or a Leu at position 48. Some such 

25 polypeptides comprise an Ala at position 48. In some instances, the polypeptide further 
comprises one or more substitution(s) relative to SEQ ID NO:38 selected from the group 
consisting of P26L; H47Q; V51T; F55P/S; L56V; Y58H; L60M; F90Y; M93L; N95D; 
N113K; V114E; R125Q; T132K; and F154L. In some instances, the polypeptide 
exhibits an interferon-alpha activity (e.g., antiviral activity, T H 1 differentiation activity, 

30 and/or antiproliferative activity). Some such polypeptides further comprise one or more 
additional amino acid(s), such as a methionine added to the N-terminus. The invention 
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also provides fusion proteins and conjugates comprising these polypeptides, and isolated 

or recombinant nucleic acids encoding these polypeptides. 

Other modifications contemplated for polypeptides of the invention include those 

described below and in the section entitled "INTERFERON- ALPHA CONJUGATES". 
5 It is to be understood that while the examples and modifications to the parent 

polypeptide are generally provided herein relative to the sequence SEQ ID NO: 1 (or 

relative to SEQ ID NO:36, or relative to some other specified sequence), the disclosed 

modifications may also be made in equivalent amino acid positions of any of the other 

polypeptides of the invention (such as a polypeptide comprising a sequence selected from 
10 SEQ ID NOs:2-35 and SEQ ID NOs:37-44) described herein. Thus, as an example, the 

substitution F48L relative to SEQ ID NO:l is understood to correspond to the substitution 

A48L in SEQ ID NO: 1 3, and so on. 

The following tables provide sequences of some interferon-alpha polypeptides of the 

invention. For clarity, the sequences are shown relative to reference sequences SEQ ID 
15 NO:l (Table 1) or SEQ ID NO:36 (Table 2). Some such polypeptides exhibit an 

interferon-alpha activity, such as antiviral activity, ThI differentiation activity, and/or 

antiproliferative activity. 



Table 1 



Polypeptide Sequence (relative to SEQ ID NO:1) 


Clone name 


SEQ ID 


SEQ ID NO:1 


B9x14EC4 


SEQ ID NO:1 


SEQ ID NO:1 F55P 


14epi07 


SEQ ID NO:2 


SEQIDNO:1 

V51 A Q52P F55S L56V F57L Y58H 


14epi08 


SEQ ID NO:3 


SEQ ID NO:1 V51 A Q52E A53T F55P L56V F57L Y58H 


14epi09 


SEQ ID NO:4 


SEQ ID NO:1 V51 A Q52E F55S L56V F57L Y58H 


14epi10 


SEQ ID NO:5 


SEQ ID NO:1 

V51A F55S L56A Y58H 


14epi11 


SEQ ID NO:6 


SEQ ID NO:1 F48L E160D 


14epi12d 


SEQ ID NO:7 


SEQ ID NO:1 
H47Q 


14epi13 


SEQ ID NO:8 


SEQ ID NO:1 M61I 


14epi14 


SEQ ID NO:9 


SEQIDNO:1 F48L 


14epi15 


SEQ ID NO:10 
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Polypeptide Sequence (relative to SEQ ID NO:1) 


Clone name 


SEQ ID 


SEQIDNO:1 F48L F55A 


14epi16 


SEQIDNO:11 


SEQ ID NO:1 F48L F55S 


14epi17 


SEQ ID NO:12 


SEQ ID NO:1 F48A 


14epi18 


SEQ ID NO:13 


SEQ ID NO:1 F48A F55A 


14epi19 


SEQ ID NO:14 


SEQ ID NO:1 F48AF55S 


14epi20 


SEQ ID NO:15 


SEQ ID NO:1 F55A 


14epi21 


SEQ ID NO:16 


SEQ ID NO:1 F55S 


1 4epi22 . 


SEQ ID NO:17 


SEQ ID NO:1 F48A F55S L111A 


14epi23 


SEQ ID NO:18 


SEQIDNO:1 F48A F55S N113KV114E 


14epi24 


SEQ ID NO:19 


SEQ ID NO:1 F48L F55S L1 11 A 


1 4epi25 


SEQ ID NO:20 


SEQ ID NO:1 F48L F55S N113K V114E 


1 4epi26 


SEQ ID NO:21 


SEQ ID NO:1 F48A F55S F65A L1 11 A 


14epi27 


SEQ ID NO:22 


SEQ ID NO:1 F48A F55S F65A N1 1 3K V1 1 4E 


14epi28 


SEQ ID NO:23 


SEQ ID NO:1 F48A F55S F65A L11 1 A 


14epi29 


SEQ ID NO:24 


SEQ ID NO:1 F48A F55A L1 1 1 A 


14epi30 


SEQ ID NO:25 


SEQ ID NO:1 F48A F55A N1 1 3K V1 1 4E 


14epi31 


SEQ ID NO:26 


SEQ ID NO:1 F48L Q52P F55S N1 1 3K V1 1 4E 


14epi32 


SEQ ID NO:27 


SEQ ID NO:1 Q52P F55S N1 1 3K V1 1 4E 




SEQ ID NO:28 


SEQ ID NO:1 Q52P F55S N1 1 3K V1 1 4E 


14epi34 


SEQ ID NO:29 


SEQ ID NO:1 F48L V51 A Q52P F55S L56V F57L Y58H 

M1 1 IK \/1 1 /IP 


14epi35 


SEQ ID NO:30 


SEQ ID NO:1 V51 A Q52E A53T F55P L56V F57L Y58H 
N113K V114E 


14epi36 


ccn in 

O 1 \—l 1 V\_/ . O 1 


SEQ ID NO:1 V51A F55S L56V F57L Y58H N113K 
V114E 


1 4epi37 


SEQ ID NO:32 


SEQ ID NO:1 V51 A Q52P F55S L56V F57L Y58H 
N113K V114E 


14epi38 


SEQ ID NO:33 


SEQ ID NO:1 H47Q F48A F55S 


14epi39 


SEQ ID NO:34 


SEQ ID NO:1 H47Q F48A F55S L56V Y58H 


1 4epi42 


SEQ ID NO:35 
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Table 2 



PnlvnpntiHp ^pnnpnrp frplativp to SFO ID NO'^fi^ 


f^lnnp namp 

V/lul 1 lUl 1 


SEQ ID 


SEQ ID NO:36 


25epi0o 


o i p\ k i /^~\ . 
ofcVj IU iNvJ.OD 


SEQ ID NO:36 F48A L154F 


25epi18 


SEQ ID NO:37 


SEQ ID NO:36 F48A T51 V S55F V56L H58Y L1 54F 


25epi19 


SEQ ID NO:38 


SEQ ID NO:36 F48A T51V V56L H58Y L154F 


25epi20 


SEQ ID NO:39 


SEQ ID NO:36 F48A T51V L154F 


25epi25 


SEQ ID NO:40 


SEQ ID NO:36 F48A T51V F90Y M93L N95D L154F 


25epi26 


SEQ ID NO:41 


SEQ ID NO:36 F48A T51V F90A R125Q L154F 


25epi27 


SEQ ID NO:42 


SEQ ID NO:36 F48A T51 V F90Y M93L N95D R125Q 
L154F 


25epi28 


SEQ ID NO:43 


SEQ ID NO:36 L1 54F E1 60D 


25epi29 


SEQ ID NO:44 



5 Sequence Variations 

As noted above, polypeptides of the present invention include polypeptides 
comprising sequences which differ from SEQ ID NO:l in 1 to 16 amino acid positions 
and comprise one or more substitutions relative to SEQ ID NO: 1 selected from the group 
consisting of F48A/L; V51P; F55A; F65A; F68P; L111A; and V114P. Polypeptides of 

10 the present invention also include polypeptides comprising sequences which differ from 
SEQ ID NO:36 in 1 to 16 amino acid positions and comprise one or more substitutions 
relative to SEQ ID NO: 36 selected from the group consisting of M21A; I24P; F48A/L; 
T51P; S55A; F65A; F68P; F90A; M93P; LI 11 A; V114P; F124A; I127P; andE160D. 
Some such polypeptides exhibit an interferon-alpha activity. 

15 For example, some polypeptides of the invention comprise a sequence having a 

length of about 151 amino acids, such as about 151, 152, 153, 154, 155 ,156, 157, 158, 
159, 160, 161, 162, 163, 164, or 165 amino acids, corresponding to a deletion of between 
1 and 15 amino acids relative to a parent polypeptide sequence (such as, for example, one 
of SEQ ID NOs:2-35 or 37-44). In some instances, between 1 and 11, e.g., between 1 and 
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10, such as between 1 and 7, e.g. between 1 and 5, such as between 1 and 3 amino acids 
are deleted from the C-terminus, i.e. the polypeptide is C-terminally truncated compared 
to the parent polypeptide sequence (such as, for example, one of SEQ ID NOs: 2-35 or 
37-44) by 1-11 amino acid residues (e.g. by 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or 1 1 amino acid 

5 residues), such as by 1-10, 1-7, e.g., by 1-5 or by 1-3 amino acid residues. Alternatively, 
or in addition, some such polypeptides are N-terminally truncated compared to the parent 
polypeptide sequence (such as, one of SEQ ID NOs: 2-35 or 37-44) by 1-4 amino acid 
residues (e.g. by 1, 2, 3, or 4 amino acid residues), e.g., 1-4, 1-3, 1-2 or 1 amino acid 
residue(s) are removed from the N-terminus. Some such polypeptides further comprise a 

10 methionine at the N-terminus. Some such polypeptides exhibit an interferon-alpha 
activity. 

As another example, some polypeptides of the invention comprise a sequence 
containing between 1 and 16 amino acid substitutions relative to SEQ ID NO:l (e.g. 1, 2, 
3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, or 16 amino acid substitutions), such as 1-14 or 1- 

15 12 or 1-10 or 1-8 or 1-6 or 1-5 or 1-4 or 1-3 or 1-2 amino acid substitutions, in which at 
least one of the substitutions is selected from the group consisting of F48A/L; V51P; 
F55A; F65A; F68P; LI 1 1 A; and VI 14P. Other polypeptides of the invention comprise a 
sequence containing between 1 and 16 amino acid substitutions relative to SEQ ED NO: 36 
(e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid substitutions), such as 

20 1-14 or 1-12 or 1-10 or 1-8 or 1-6 or 1-5 or 1-4 or 1-3 or 1-2 amino acid substitutions, in 
which at least one of the substitutions is selected from the group consisting of M21A; 
I24P; F48A/L; T51P; S55A; F65A; F68P; F90A; M93P; LI 11 A; V114P; F124A; I127P; 
and E160D. One or more additional amino acid substitutions may be made in the 
polypeptide sequence according to, for example, a substitution group (such as, a 

25 conservative substitution group), such as one set forth below. Alternatively, or in 

addition, one or more additional amino acid substitutions may made in the polypeptide 
sequence which introduces or removes an amino acid residue comprising an attachment 
group for a non-polypeptide moiety. Examples include introduction of one or more N- 
glycosylation site(s), introduction of one or more cysteine residue(s) or lysine residue(s), 

30 removal of one or more N-glycosylation site(s), and/or or removal of one or more 

lysine(s) or histidine(s). Some such polypeptides exhibit an interferon-alpha activity. 
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As a non-limiting example, a polypeptide of the invention may have a sequence 
which differs from SEQ ID NO: 1 in a total of up to 16 positions (which may be a 
combination of amino acid substitutions, deletions, and/or insertions, including those 
described above). In some instances, none, some, or all of the substitutions are 
5 substitutions according to a substitution group defined below. 

Amino acid substitutions in accordance with the invention may include, but are not 
limited to, one or more conservative amino acid substitutions. Conservative substitution 
tables providing functionally similar amino acids are well known in the art. One example 
is provided in the table below (Table 3), which sets forth six exemplary groups that 
10 contain amino acids which may be considered "conservative substitutions" for one 
another. 



Table 3 

Conservative Substitution Groups 



1 


Alanine (A) 


Glycine (G) 


Serine (S) 


Threonine (T) 


2 


Aspartic acid (D) 


Glutamic acid (E) 






3 


Asparagine (N) 


Glutamine (Q) 






4 


Arginine (R) 


Lysine (K) 


Histidine (H) 




5 


Isoleucine (I) 


Leucine (L) 


Methionine (M) 


Valine (V) 


6 


Phenylalanine (F) 


Tyrosine (Y) 


Tryptophan (W) 





15 Other substitution groups of amino acids can be envisioned. For example, amino 

acids can be grouped by similar function or chemical structure or composition (e.g., 
acidic, basic, aliphatic, aromatic, sulfur-containing). For example, an Aliphatic grouping 
may comprise: Glycine (G), Alanine (A), Valine (V), Leucine (L), Isoleucine (I). Other 
groups containing amino acids that are considered conservative substitutions for one 

20 another include: Aromatic: Phenylalanine (F), Tyrosine (Y), Tryptophan (W); Sulfur- 
containing: Methionine (M), Cysteine (C); Basic: Arginine (R), Lysine (K), Histidine 
(H); Acidic: Aspartic acid (D), Glutamic acid (E), Asparagine (N), Glutamine (Q). See 
also Creighton (1984) Proteins, W.H. Freeman and Company, for additional groupings of 
amino acids. Listing of a polypeptide sequence herein, in conjunction with the above 

25 substitution groups, provides an express listing of all conservatively substituted 
polypeptide sequences. 
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Percent Sequence Identity 

In one aspect, the invention provides isolated or recombinant polypeptides each 
comprising a sequence having at least 90% sequence identity (e.g., at least about 91%, at 
5 least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 
96%, at least about 97%, at least about 98%, or at least about 99% amino acid sequence 
identity) to any one of SEQ ID NOs:l-35, wherein the sequence comprises one or more 
of: Ala or Leu at position 48; Pro at position 51; Ala at position 55; Ala at position 65; 
Pro at position 68; Ala at position 111; and Pro at position 1 14, amino acid position 
10 relative to SEQ ID NO: 1. In some instances the polypeptide exhibits an interferon-alpha 
activity. 

The invention also provides isolated or recombinant polypeptides each comprising 
a sequence having at least 90% sequence identity (e.g., at least about 91%, at least about 
92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at 

15 least about 97%, at least about 98%, or at least about 99% amino acid sequence identity) 
to any one of SEQ ID NOs: 36-44, wherein the sequence comprises one or more of: Ala at 
position 21; Pro at position 24; Ala or Leu at position 48; Pro at position 51; Ala at 
position 55; Ala at position 65; Pro at position 68; Ala at position 90; Pro at position 93; 
Ala at position 111; Pro at position 1 14; Ala at position 124; Pro at position 127; and Asp 

20 at position 160, amino acid position relative to SEQ ID NO:36. In some instances the 
polypeptide exhibits an interferon-alpha activity. 

The degree to which a sequence (polypeptide or nucleic acid) is similar to another 
provides an indication of similar structural and functional properties for the two 
sequences. Accordingly, in the context of the present invention, sequences which have a 

25 similar sequence to any given exemplar sequence are a feature of the present invention. 
In particular, sequences that have percent sequence identities as defined below are a 
feature of the invention. 

A variety of methods of determining sequence relationships can be used, including 
manual alignment and computer assisted sequence alignment and analysis. A variety of 

30 computer programs for performing sequence alignments are available, or an alignment 
can be prepared manually by one of skill, as described below. 
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As noted above, the sequences of the polypeptides and nucleic acids employed in the 
subject invention need not be identical, but can be substantially identical to the 
corresponding sequence of a polypeptide of the invention or nucleic acid of the invention. 
For example, polypeptides of the invention can be subject to various changes, such as one 

5 or more amino acid insertions, deletions, and/or substitutions, either conservative or non- 
conservative, including where, e.g., such changes might provide for certain advantages in 
their use, such as, in their therapeutic or prophylactic use or administration or diagnostic 
application. The nucleic acids of the invention can also be subject to various changes, 
such as one or more substitutions of one or more nucleic acids in one or more codons 

10 such that a particular codon encodes the same or a different amino acid, resulting in either 
a silent variation (as defined herein) or non-silent variation, or one or more deletions of 
one or more nucleic acids (or codons) in the sequence. The nucleic acids can also be 
modified to include one or more codons that provide for optimum expression in an 
expression system (e.g., bacterial or mammalian), while, if desired, said one or more 

15 codons still encode the same amino acid(s). Such nucleic acid changes might provide for 
certain advantages in their therapeutic or prophylactic use or administration, or diagnostic 
application. The nucleic acids and polypeptides can be modified in a number of ways so 
long as they comprise a sequence substantially identical (as defined below) to a sequence 
in a respective nucleic acid or polypeptide of the invention. 

20 The term "identical" or "identity," in the context of two or more nucleic acid or 

polypeptide sequences, refers to two or more sequences that are the same or have a 
specified percentage of amino acid residues or nucleotides that are the same, when 
compared and aligned for maximum similarity, as determined using the sequence 
comparison algorithm described below or by visual inspection. 

25 The "percent sequence identity" ("% identity") of a subject sequence to a reference 

(i.e. query) sequence means that the subject sequence is identical (i.e., on an amino acid- 
by-amino acid basis for a polypeptide sequence, or a nucleotide-by-nucleotide basis for a 
polynucleotide sequence) by a specified percentage to the query sequence over a 
comparison length. 

30 The percent sequence identity of a subject sequence to a query sequence is calculated 

as follows. First , the optimal alignment of the two sequences is determined using a 
sequence comparison algorithm with specific alignment parameters. This determination 
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of the optimal alignment may be performed using a computer, or may be manually 
calculated, as described below. Then, the two optimally aligned sequences are compared 
over the comparison length, and the number of positions in the optimal alignment at 
which identical residues occur in both sequences are determined, which provides the 
5 number of matched positions. The number of matched positions is then divided by the 
total number of positions of the comparison length (which, unless otherwise specified, is 
the length of the query sequence), and then the result is multiplied by 100, to yield the 
percent sequence identity of the subject sequence to the query sequence. 

With regard to polypeptide sequences, typically one sequence is regarded as a "query 

10 sequence" (for example, a polypeptide sequence of the invention) to which one or more 
other sequences, i.e., "subject sequence(s)" (for example, sequences present in a sequence 
database) are compared. The sequence comparison algorithm uses the designated 
alignment parameters to determine the optimal alignment between the query sequence 
and the subject sequence(s). When comparing a query sequence against a sequence 

15 database, such as, e.g., GENBANK® (Genetic Sequence Data Bank; U.S. Department of 
Health and Human Services) or GENESEQ® (Thomson Derwent; also available as 
DGENE® on STN), usually only the query sequence and the alignment parameters are 
input into the computer; optimal alignments between the input query sequence and each 
subject sequence present in the database are returned, generally for up to a desired 

20 number of subject sequences. 

Two polypeptide sequences are "optimally aligned" when they are aligned using 
defined parameters, i.e., a defined amino acid substitution matrix, gap existence penalty 
(also termed gap open penalty), and gap extension penalty, so as to arrive at the highest 
similarity score possible for that pair of sequences. The BLOSUM62 matrix (Henikoff 

25 and Henikoff (1992) Proc. Natl. Acad. Sci. USA 89(22): 10915-10919) is often used as a 
default scoring substitution matrix in polypeptide sequence alignment algorithms (such as 
BLASTP, described below). The gap existence penalty is imposed for the introduction of 
a single amino acid gap in one of the aligned sequences, and the gap extension penalty is 
imposed for each residue position in the gap. Unless otherwise stated, alignment 

30 parameters employed herein are: BLOSUM62 scoring matrix, gap existence penalty =11, 
and gap extension penalty = 1. The alignment score is defined by the amino acid 
positions of each sequence at which the alignment begins and ends (e.g. the alignment 
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window), and optionally by the insertion of a gap or multiple gaps into one or both 
sequences, so as to arrive at the highest possible similarity score. 

While optimal alignment between two or more sequences can be determined 
manually (as described below), the process is facilitated by the use of a computer- 

5 implemented alignment algorithm such as BLAST® (National Library of Medicine), e.g., 
BLASTP for polypeptide sequences and BLASTN for nucleic acid sequences, described 
in Altschul et al (1997) Nucleic Acids Res. 25:3389-3402, and made available to the 
public through various sources, such as the National Center for Biotechnology 
Information (NCBI) Website. When using a computerized BLAST interface, if the option 

10 exists to use a "low complexity filter", this option should be turned off (i.e., no filter). 

The optimal alignment between two polypeptide sequences can also be determined 
by a manual calculation of the BLASTP algorithm (i.e., without aid of a computer) using 
the same alignment parameters specified above (matrix = BLOSUM62, gap open penalty 
= 11, and gap extension penalty = 1). To begin, the two sequences are initially aligned by 

15 visual inspection. An initial alignment score is then calculated as follows: for each 

individual position of the alignment (i.e., for each pair of aligned residues), a numerical 
value is assigned according to the BLOSUM62 matrix (Fig. 4). The sum of the values 
assigned to each pair of residues in the alignment is the initial alignment score. If the two 
sequences being aligned are highly similar, often this initial alignment provides the 

20 highest possible alignment score. The alignment with the highest possible alignment 
score is the optimal alignment based on the alignment parameters employed. Fig. 5A 
shows an example of the calculation of an alignment score for two sequences, a "query" 
sequence, identified herein as residues 29-50 of SEQ ID NO:l (upper), and a "subject" 
sequence, identified herein as residues 30-52 of SEQ ID NO: 8 (lower). The sequences 

25 were aligned by visual inspection, and the numerical value assigned by the BLOSUM62 
matrix for each aligned pair of amino acids is shown beneath each position in the 
alignment (to aid in visualization, each identical pair of amino acids in the alignment is 
shown in boldface). In this example, this initial alignment provided the highest possible 
alignment score (the sum of the values shown beneath each aligned position); any other 

30 alignment of these two sequences, with or without gaps, would result in a lower 
alignment score. 
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In some instances, a higher alignment score might be obtained by introducing one or 
more gaps into the alignment. Whenever a gap is introduced into an alignment, a gap 
open penalty is assigned, and in addition a gap extension penalty is assessed for each 
residue position within that gap. Therefore, using the alignment parameters described 
above (including gap open penalty =11 and gap extension penalty = 1), a gap of one 
residue in the alignment would correspond to a value of -(ll+(lxl)) = -12 assigned to 
the gap; a gap of three residues would correspond to a value of - (1 1+(3 x 1)) = -14 
assigned to the gap, and so on. This calculation is repeated for each new gap introduced 
into the alignment. Figs. 5B and 5C show an example which demonstrates how 
introduction of a gap into an alignment can result in a higher alignment score, despite the 
gap penalty. Fig. 5B shows an initial alignment of residues 29-50 of SEQ ID NO: 1 
(upper, query) and residues 30-50 of SEQ ID NO:46 (lower, subject) made by visual 
inspection, which results in an initial alignment score of 67. Fig. 5C shows the effect of 
the introduction of a one-residue gap in SEQ ID NO:46 on the alignment score; despite 
the gap penalty of —12, the overall alignment score of the two sequences increases to 88. 
In this example, the alignment shown in Fig. 5C provides the highest possible alignment 
score, and is thus the optimal alignment of these two sequences; any other alignment of 
these two sequences (with or without gaps) would result in a lower alignment score. 

It is to be understood that the examples of sequence alignment calculations described 
above, which use relatively short sequences, are provided for illustrative purposes only; in 
practice, the alignment parameters employed (BLOSUM62 matrix, gap open penalty = 
1 1, and gap extension penalty = 1) are generally intended for polypeptide sequences 85 
amino acids in length or longer. The NCBI website provides the following alignment 
parameters for sequences of other lengths (which are suitable for computer-aided as well 
as manual alignment calculation, using the same procedure as described above). For 
sequences of 50-85 amino acids in length, optimal parameters are BLOSUM80 matrix 
(Henikoff and Henikoff, supra), gap open penalty = 10, and gap extension penalty = 1. 
For sequences of 35-50 amino acids in length, optimal parameters are PAM70 matrix 
(Dayhoff, M.O., Schwartz, R.M. & Orcutt, B.C. (1978) "A model of evolutionary change 
in proteins." In Atlas of Protein Sequence and Structure, vol. 5, suppl. 3, M.O. Dayhoff 
(ed.), pp. 345-352, Natl. Biomed. Res. Found.,Washington, DC), gap open penalty = 10, 
and gap extension penalty = 1. For sequences of less than 35 amino acids in length, 
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optimal parameters are PAM30 matrix (Dayhoff, M.O., supra), gap open penalty = 9, and 
gap extension penalty = 1 . 

Once the sequences are optimally aligned, the percent identity of the subject 
sequence relative to the query sequence is calculated by counting the number of positions 
in the optimal alignment which contain identical residue pairs, divide that by the number 
of residues in the comparison length (which, unless otherwise specified, is the number of 
residues in the query sequence), and multiplying the resulting number by 100. Referring 
back to the examples shown in Fig. 5, in each example the sequence designated as the 
query sequence is 22 amino acids in length. In the alignment of Fig. 5A, 20 pairs of 
aligned amino acid residues (shown in boldface) are identical in the optimal alignment of 
the query sequence (upper) with the subject sequence (lower). Thus, this particular 
subject sequence has (20/22) x 100 = 91.1% identity to the query sequence; in other 
words, the subject sequence in the alignment of Fig. 5 A has at least 91% amino acid 
sequence identity to the query sequence. In the alignment shown in Fig. 5C, 18 pairs of 
amino acid residues (shown in boldface) in the optimal alignment are identical; thus this 
particular subject sequence has (18/22) x 100 = 81.8% identity to the query sequence; in 
other words, the subject sequence in the alignment of Fig. 5C has at least 81% amino acid 
sequence identity to the query sequence. 

As applied to polypeptides, the term "substantial identity" (or "substantially 
identical") typically means that when two amino acid sequences (i.e. a query sequence 
and a subject sequence) are optimally aligned using the BLASTP algorithm (manually or 
via computer) using appropriate parameters described above, the subject sequence has at 
least about 60% , 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, 99%, or more percent amino acid sequence identity to the query sequence. In some 
instances, the substantial identity exists over a comparison length of at least about 100 
amino acid residues, such as, at least about 110, 120, 125, 130, 135, 140, 145, 150, 155, 
156, 157, 158, 159, 160, 161, 162, 163, 164, 165, or 166 amino acid residues. 

Similarly, as applied in the context of two nucleic acid sequences, the term 
substantial identity (or substantially identical) means that when two nucleic acid 
sequences (i.e. a query and a subject sequence) are optimally aligned using the BLASTN 
algorithm (manually or via computer) using appropriate parameters described below, the 
subject sequence has at least about 60%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 
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94%, 95%, 96%, 97%, 98%, 99%, or more percent nucleic acid sequence identity to the 
query sequence. Parameters used for nucleic acid sequence alignments are: match 
reward 1, mismatch penalty -3, gap existence penalty 5, gap extension penalty 2 
(substitution matrices are not used in the BLASTN algorithm). In some instances, the 
substantial identity exists over a comparison length of at least about 300 nucleotide 
residues, such as at least about 330, 360, 375, 390, 405, 420, 435, 450, 465, 480, 483, 
486, 489, 492, 495, or 498 nucleotides. 

Additional Aspects 

Any polypeptide of the invention may be present as part of a larger polypeptide 
sequence, e.g. a fusion protein, such as occurs upon the addition of one or more domains 
or subsequences for stabilization or detection or purification of the polypeptide. A 
polypeptide purification subsequence may include, e.g., an epitope tag, a FLAG tag, a 
polyhistidine sequence, a GST fusion, or any other detection/purification subsequence or 
"tag" known in the art. These additional domains or subsequences either have little or no 
effect on the activity of the polypeptide of the invention, or can be removed by post 
synthesis processing steps such as by treatment with a protease, inclusion of an intein, or 
the like. 

The invention includes fusion proteins comprising a polypeptide of the invention, 
e.g., as described herein, fused to an Ig molecule, e.g., a human IgG Fc ("fragment 
crystallizable," or fragment complement binding) hinge, CH2 domain and CH3 domain, 
and nucleotide sequences encoding such fusion protein. Fc is the portion of the antibody 
responsible for binding to antibody receptors on cells and the Clq component of 
complement. These fusion proteins and their encoding nucleic acids are useful as 
prophylactic and/or therapeutic drugs or as diagnostic tools (see also, e.g., Challita-Eid, P. 
et al. (1998) J. Immunol 160:3419-3426; Sturmhoefel, K. et al. (1999) Cancer Res 
59:4964-4972). The invention also includes fusion proteins comprising a polypeptide of 
the invention, fused to an albumin molecule, such as human serum albumin (HSA), as 
described, for example, in U.S. Patent No. 5,876,969, and nucleotide sequences encoding 
the fusion protein. The Ig and albumin fusion proteins may exhibit increased polypeptide 
serum half-life and/or functional in vivo half-life, reduced polypeptide antigenicity, 
increased polypeptide storage stability, or increasing bioavailability, e.g. increased 
AUC SC , and are thus may be useful as prophylactic and/or therapeutic drugs. 
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Any polypeptide of the invention may also comprise one or more modified amino 
acid. The modified amino acid may be, e.g., a glycosylated amino acid, a PEGylated 
amino acid, a farnesylated amino acid, an acetylated amino acid, a biotinylated amino 
acid, an amino acid conjugated to a lipid moiety, or an amino acid conjugated to an 

5 organic derivatizing agent. The presence of modified amino acids may be advantageous 
in, for example, (a) increasing polypeptide serum half-life and/or functional in vivo half- 
life, (b) reducing polypeptide antigenicity, (c) increasing polypeptide storage stability, or 
(d) increasing bioavailability, e.g. increasing the AUC SC . Amino acid(s) are modified, for 
example, co-translationally or post-translationally during recombinant production (e.g., 

10 N-linked glycosylation at N-X-S/T motifs during expression in mammalian cells) or 

modified by synthetic means. This aspect is described in more detail in the section herein 
entitled "INTERFERON-ALPHA CONJUGATES". 

The invention also provides a composition comprising at least one polypeptide of the 
invention, and an excipient or carrier. The composition may be a composition comprising 

1 5 a pharmaceutical^ acceptable excipient or carrier. Exemplary compositions and 
excipients and carriers are described below. 

Making Polypeptides 

Recombinant methods for producing and isolating polypeptides of the invention are 
described herein. In addition to recombinant production, the polypeptides may be 

20 produced by direct peptide synthesis using solid-phase techniques (see, e.g., Stewart et ah 
(1969) Solid-Phase Peptide Synthesis, WH Freeman Co, San Francisco; Merrifield J. 
(1963) J Am Chem Soc 85:2149-2154). Peptide synthesis may be performed using 
manual techniques or by automation. Automated synthesis may be achieved, for 
example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin Elmer, Foster 

25 City, Calif.) in accordance with the instructions provided by the manufacturer. For 

example, subsequences may be chemically synthesized separately and combined using 
chemical methods to provide full-length polypeptides or fragments thereof. Alternatively, 
such sequences may be ordered from any number of companies which specialize in 
production of polypeptides. Most commonly, polypeptides of the invention may be 

30 produced by expressing coding nucleic acids and recovering polypeptides, e.g., as 
described below. 
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Methods for producing the polypeptides of the invention are also included. One 
such method comprises introducing into a population of cells any nucleic acid of the 
invention described herein, which is operatively linked to a regulatory sequence effective 
to produce the encoded polypeptide, culturing the cells in a culture medium to express 
5 the polypeptide, and isolating the polypeptide from the cells or from the culture medium. 
An amount of nucleic acid sufficient to facilitate uptake by the cells (transfection) and/or 
expression of the polypeptide is utilized. The nucleic acid is introduced into such cells by 
any delivery method described herein, including, e.g., injection, gene gun, passive uptake, 
etc. The nucleic acid may be part of a vector, such as a recombinant expression vector, 

10 including a DNA plasmid vector, or any vector described herein. The nucleic acid or 
vector comprising a nucleic acid of the invention may be prepared and formulated as 
described herein, above. Such a nucleic acid or expression vector may be introduced into 
a population of cells of a mammal in vivo, or selected cells of the mammal (e.g., tumor 
cells) may be removed from the mammal and the nucleic acid expression vector 

15 introduced ex vivo into the population of such cells in an amount sufficient such that 
uptake and expression of the encoded polypeptide results. Or, a nucleic acid or vector 
comprising a nucleic acid of the invention is produced using cultured cells in vitro. In 
one aspect, the method of producing a polypeptide of the invention comprises introducing 
into a population of cells a recombinant expression vector comprising any nucleic acid of 

20 the invention described herein in an amount and formula such that uptake of the vector 
and expression of the encoded polypeptide will result; administering the expression 
vector into a mammal by any introduction/delivery format described herein; and isolating 
the polypeptide from the mammal or from a byproduct of the mammal. 

Antibodies 

25 In another aspect of the invention, a polypeptide of the invention (or an antigenic 

fragment thereof) is used to produce antibodies which have, e.g., diagnostic, therapeutic, 
or prophylactic uses, e.g., related to the activity, distribution, and expression of 
polypeptides and fragments thereof. Antibodies to polypeptides of the invention may be 
generated by methods well known in the art. Such antibodies may include, but are not 

30 limited to, polyclonal, monoclonal, chimeric, humanized, single chain, Fab fragments and 
fragments produced by a Fab expression library. Antibodies, e.g., those that block 
receptor binding, are especially preferred for therapeutic and/or prophylactic use. 
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Polypeptides for antibody induction do not require biological activity; however, the 
polypeptides or peptides should be antigenic. Peptides used to induce specific antibodies 
may have an amino acid sequence consisting of at least about 10 amino acids, preferably 
at least about 15 or 20 amino acids or at least about 25 or 30 amino acids. Short stretches 
of a polypeptide may be fused with another protein, such as keyhole limpet hemocyanin, 
and antibody produced against the chimeric molecule. 

Methods of producing polyclonal and monoclonal antibodies are known to those of 
skill in the art, and many antibodies are available. See, e.g., Current Protocols in 
Immunology, John Colligan et al, eds., Vols. I-IV (John Wiley & Sons, Inc., NY, 1991 
and 2001 Supplement); and Harlow and Lane (1989) Antibodies: A Laboratory Manual 
Cold Spring Harbor Press, NY; Stites et al. (eds.) Basic and Clinical Immunology (4th 
ed.) Lange Medical Publications, Los Altos, CA, and references cited therein; and Goding 
(1986) Monoclonal Antibodies: Principles and Practice (2d ed.) Academic Press, New 
York, NY; and Kohler and Milstein (1975) Nature 256:495-497. Other suitable 
techniques for antibody preparation include selection of libraries of recombinant 
antibodies in phage or similar vectors. See, Huse et al. (1989) Science 246:1275-1281; 
and Ward et al. (1989) Nature 341:544-546. Specific monoclonal and polyclonal 
antibodies and antisera will usually bind with a K D of at least about 0.1 [xM, preferably at 
least about 0.01 [xM or better, and most typically and preferably, 0.001 (iMor better. 

Detailed methods for preparation of chimeric (humanized) antibodies can be found 
in U.S. Patent 5,482,856. Additional details on humanization and other antibody 
production and engineering techniques can be found in Borrebaeck (ed.) (1995) Antibody 
Engineering, 2 nd Edition Freeman and Company, NY (Borrebaeck); McCafferty et al. 
(1996) Antibody Engineering, A Practical Approach IRL at Oxford Press, Oxford, 
England (McCafferty), and Paul (1995) Antibody Engineering Protocols Humana Press, 
Towata, NJ (Paul). 

In one aspect, this invention provides for fully humanized antibodies against the 
polypeptides of the invention or fragments thereof. Humanized antibodies are especially 
desirable in applications where the antibodies are used as therapeutics and/or 
prophylactics in vivo in human patients. Human antibodies consist of characteristically 
human immunoglobulin sequences. The human antibodies of this invention can be 
produced in using a wide variety of methods (see, e.g., Larrick et al., U.S. Pat. No. 
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5,001,065, and Borrebaeck McCafferty and Paul, supra, for a review). In one aspect, the 
human antibodies of the present invention are produced initially in trioma cells. Genes 
encoding the antibodies are then cloned and expressed in other cells, such as nonhuman 
mammalian cells. The general approach for producing human antibodies by trioma 
5 technology is described by Ostberg et al. (1983), Hybridoma 2:361-367, Ostberg, U.S. 
Pat. No. 4,634,664, and Engelman et al., U.S. Pat. No. 4,634,666. The antibody- 
producing cell lines obtained by this method are called triomas because they are 
descended from three cells - two human and one mouse. Triomas have been found to 
produce antibody more stably than ordinary hybridomas made from human cells. 
10 Other uses contemplated for polypeptides of the invention are provided throughout 

the specification. 

INTERFERON-ALPHA CONJUGATES 

In another aspect, the invention relates to a conjugate comprising a polypeptide 
exhibiting an interferon-alpha activity which comprises an amino acid sequence of any 

15 one of SEQ ID NOs:2-35 and SEQ ID NOs: 37-44, and at least one non-polypeptide 
moiety attached to the polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, e.g. 1 or 2 non- 
polypeptide moieties attached to the polypeptide. It will be understood that the conjugate 
also exhibits an interferon-alpha activity (such as, antiviral activity, TrI differentiation 
activity, and/or antiproliferative activity). 

20 In another aspect the conjugate comprises a polypeptide exhibiting an interferon- 

alpha activity which comprises an amino acid sequence which differs in 1 to 16 amino 
acid positions from SEQ ID NO: 1 and comprises one or more substitutions relative to 
SEQ ID NO: 1 selected from the group consisting of F48A/L; V51P; F55A; F65A; F68P; 
LI 1 1A; and VI 14P; and at least one non-polypeptide moiety attached to the polypeptide, 

25 such as e.g., 1-6, 1-5, 1-4, 1-3, e.g. 1 or 2 non-polypeptide moieties attached to the 

polypeptide. In some instances, the amino acid sequence further comprises one or more 
substitutions which introduces an attachment group for the non-polypeptide moiety (e.g., 
by substitution of an amino acid residue for a different residue which comprises an 
attachment group for the non-polypeptide moiety, or by insertion of an additional amino 

30 acid residue which comprises an attachment group for the non-polypeptide moiety). It 
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will be understood that the conjugate also exhibits an interferon-alpha activity (such as, 
antiviral activity, T H 1 differentiation activity, and/or antiproliferative activity). 

The invention also provides conjugates which each comprise a polypeptide 
comprising a sequence which (a) differs in 1 to 1 6 amino acid positions from SEQ ID 
5 NO: 1 and (b) comprises the substitution F48A/L relative to SEQ ID NO:l, and at least 
one non-poly peptide moiety attached to the polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, 
e.g. 1 or 2 non-polypeptide moieties attached to the polypeptide. Some such conjugates 
comprise a polypeptide sequence which differs from SEQ ID NO: 1 in 1, 2, 3, 4, 5, 6, 7, 8, 
9, 10, 1 1, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, in 

10 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 1- 
1 1 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1-8 
amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. Some such conjugates comprise the substitution F48A relative to 

15 SEQ ID NO: 1. In some instances, the polypeptide part of the conjugate further comprises 
one or more substitution(s) relative to SEQ ID NO:l selected from the group consisting of 
H47Q; V51A/P; Q52P/E; A53T; F55A/S; L56V; F65A; F57L; Y58H; M61I; F68P; 
LI 1 1A; Nl 13K; VI 14E/P; and E160D. Some such conjugates exhibit an interferon- 
alpha activity (e.g., antiviral activity, T H 1 differentiation activity, and/or antiproliferative 

20 activity). Some such conjugates further comprise one or more additional amino acid(s), 
such as a methionine added to the N-terminus of the polypeptide. 

The invention also provides conjugates which each comprise a polypeptide 
comprising a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID 
NO: 1 and (b) comprises the substitution F55A relative to SEQ ID NO: 1, and at least one 

25 non-polypeptide moiety attached to the polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, e.g. 1 
or 2 non-polypeptide moieties attached to the polypeptide. Some such conjugates 
comprise a polypeptide sequence which differs from SEQ ID NO: 1 in 1, 2, 3, 4, 5, 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, in 
1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 1- 

30 11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1-8 
amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 



WO 2005/113592 



PCT/US2005/0 17471 



amino acid positions. In some instances, the polypeptide part of the conjugate further 
comprises one or more substitution(s) relative to SEQ ID NO: 1 selected from the group 
consisting of H47Q; F48A/L; V51A/P; Q52P/E; A53T; L56V; F65A; F57L; Y58H; 
M61I; F68P; L111A; N113K; V114E/P; and E160D. Some such conjugates exhibit an 
5 interferon-alpha activity (e.g., antiviral activity, T H 1 differentiation activity, and/or 
antiproliferative activity). Some such conjugates further comprise one or more 
additional amino acid(s), such as a methionine added to the N-terminus of the 
polypeptide. 

The invention also provides conjugates which each comprise a polypeptide 

10 comprising a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID 
NO: 1 and (b) comprises the substitution F65A relative to SEQ ID NO: 1, and at least one 
non-polypeptide moiety attached to the polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, e.g. 1 
or 2 non-polypeptide moieties attached to the polypeptide. Some such conjugates 
comprise a polypeptide sequence which differs from SEQ ID NO:l in 1, 2, 3, 4, 5, 6, 7, 8, 

15 9, 10, 1 1, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, in 
1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 1- 
1 1 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1-8 
amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 

20 amino acid positions. In some instances, the polypeptide part of the conjugate further 
comprises one or more substitutions relative to SEQ ID NO:l selected from the group 
consisting of H47Q; F48A/L; V51A/P; Q52P/E; A53T; F55A/S; L56V; F57L; Y58H; 
M61I; F68P; LI 1 1A; Nl 13K; VI 14E/P; and E160D. Some such conjugates exhibit an 
interferon-alpha activity (e.g., antiviral activity, T H 1 differentiation activity, and/or 

25 antiproliferative activity). Some such conjugates further comprise one or more 
additional amino acid(s), such as a methionine added to the N-terminus of the 
polypeptide. 

The invention also provides conjugates which each comprise a polypeptide 
comprising a sequence which (a) differs in 1 to 1 6 amino acid positions from SEQ ID 
30 NO: 1 and (b) comprises the substitution LI 1 1 A relative to SEQ ID NO: 1, and at least one 
non-polypeptide moiety attached to the polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, e.g. 1 
or 2 non-polypeptide moieties attached to the polypeptide. Some such conjugates 
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comprise a polypeptide sequence which differs from SEQ ID NO:l in 1,2, 3, 4, 5, 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, in 
1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 1- 
1 1 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1-8 
5 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. In some instances, the polypeptide part of the conjugate further 
comprises one or more substitutions relative to SEQ ED NO:l selected from the group 
consisting of H47Q; F48A/L; V51A/P; Q52P/E; A53T; F55A/S; L56V; F65A; F57L; 
10 Y58H; M61I; F68P; Nl 13K; VI14E/P; and E160D. Some such conjugates exhibit an 
interferon-alpha activity (e.g., antiviral activity, ThI differentiation activity, and/or 
antiproliferative activity). Some such conjugates further comprise one or more 
additional amino acid(s), such as a methionine added to the N-terminus of the 
polypeptide. 

15 The invention also provides conjugates which each comprise a polypeptide 

comprising a sequence which (a) differs in 0 to 16 amino acid positions from SEQ ID 
NO: 13 and (b) comprises one or more of: Ala or Leu at position 48; Pro at position 51; 
Ala at position 55; Ala at position 65; Pro at position 68; Ala at position 111; and Pro at 
position 1 14 (position numbering relative to SEQ ID NO: 13), and at least one non- 
20 polypeptide moiety attached to the polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, e.g. 1 or 2 
non-polypeptide moieties attached to the polypeptide. Some such conjugates comprise a 
polypeptide comprising a sequence which differs from SEQ ID NO: 13 in 0, 1, 2, 3, 4, 5, 
6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 0-15 amino acid 
positions, in 0-14 amino acid positions, in 0-13 amino acid positions, in 0-12 amino acid 
25 positions, in 0-1 1 amino acid positions, in 0-10 amino acid positions, in 0-9 amino acid 
positions, in 0-8 amino acid positions, in 0-7 amino acid positions, in 0-6 amino acid 
positions, in 0-5 amino acid positions, in 0-4 amino acid positions, in 0-3 amino acid 
positions, in 0-2 amino acid positions, or in 0-1 amino acid positions. Some such 
conjugates comprise a polypeptide sequence which comprises an Ala or a Leu at position 
30 48. Some such conjugates comprise a polypeptide sequence which comprises an Ala at 
position 48. In some instances, the polypeptide part of the conjugate further comprises 
one or more substitution(s) relative to SEQ ID NO: 13 selected from the group consisting 
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of H47Q; V51A; Q52P/E; A53T; F55S; L56V; F57L; Y58H; M61I; N113K; V114E; and 
E160D. Some such conjugates exhibit an interferon-alpha activity (e.g., antiviral 
activity, T H 1 differentiation activity, and/or antiproliferative activity). Some such 
conjugates comprise a polypeptide sequence further comprising one or more additional 
5 amino acid(s), such as a methionine added to the N-terminus. 

In another aspect the conjugate comprises a polypeptide exhibiting an interferon- 
alpha activity which comprises an amino acid sequence which differs in 1 to 16 amino 
acid positions from SEQ ID NO: 36 and comprises one or more substitutions relative to 
SEQ ID NO:36 selected from the group consisting of M21A; I24P; F48A/L; T51P; S55A; 

10 F65A; F68P; F90A; M93P; LI 1 1 A; VI 14P; F124A; I127P; and E160D; and at least one 
non-polypeptide moiety attached to the polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, e.g. 1 
or 2 non-polypeptide moieties attached to the polypeptide. In some instances, the amino 
acid sequence further comprises one or more substitutions which introduces an 
attachment group for the non-polypeptide moiety (e.g., by substitution of an amino acid 

] 5 residue for a different residue which comprises an attachment group for the non- 
polypeptide moiety, or by insertion of an additional amino acid residue which comprises 
an attachment group for the non-polypeptide moiety). 

The invention also provides conjugates which each comprise a polypeptide 
comprising a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID 

20 NO:36 and (b) comprises the substitution F48A/L relative to SEQ ID NO:36, and at least 
one non-polypeptide moiety attached to the polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, 
e.g. 1 or 2 non-polypeptide moieties attached to the polypeptide. Some such conjugates 
comprise a polypeptide sequence which differs from SEQ ID NO: 36 in 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 

25 in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 
1-1 1 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 
8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. Some such conjugates comprise the substitution F48A relative to 

30 SEQ ID NO:36. In some instances, the polypeptide part of the conjugate further 

comprises one or more substitution(s) relative to SEQ LD NO: 36 selected from the group 
consisting of M21A; I24P; P26L; H47Q; T51V/P; S55P/F/A; V56L; H58Y; L60M; 
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F65A; F68P; F90Y/A; M93L/P; N95D; L111A; N113K; V114E/P; F124A; R125Q; 
I127P; T132K; L154F; and E160D. Some such conjugates exhibit an interferon-alpha 
activity (e.g., antiviral activity, ThI differentiation activity, and/or antiproliferative 
activity). Some such conjugates further comprise one or more additional amino acid(s), 

5 such as a methionine added to the N-terminus of the polypeptide. 

The invention also provides conjugates which each comprise a polypeptide 
comprising a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ED 
NO:36 and (b) comprises the substitution F90A relative to SEQ ID NO: 36, and at least 
one non-polypeptide moiety attached to the polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, 

10 e.g. 1 or 2 non-polypeptide moieties attached to the polypeptide. Some such conjugates 
comprise a polypeptide sequence which differs from SEQ ID NO:36 in 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 
in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 
1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 

15 8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. In some instances, the polypeptide part of the conjugate further 
comprises one or more substitution(s) relative to SEQ ED NO:36 selected from the group 
consisting of M21A; I24P; P26L; H47Q; F48A/L; T51V/P; S55P/F/A; V56L; H58Y; 

20 L60M; F65A; F68P; M93L/P; N95D; L111A; N113K; V114E/P; F124A; R125Q; I127P; 
T132K; L154F; and E160D. Some such conjugates exhibit an interferon-alpha activity 
(e.g., antiviral activity, T H 1 differentiation activity, and/or antiproliferative activity). 
Some such conjugates further comprise one or more additional amino acid(s), such as a 
methionine added to the N-terminus of the polypeptide. 

25 The invention also provides conjugates which each comprise a polypeptide 

comprising a sequence which (a) differs in 1 to 16 amino acid positions from SEQ ID 
NO:36 and (b) comprises the substitution E160D relative to SEQ ED NO:36, and at least 
one non-polypeptide moiety attached to the polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, 
e.g. 1 or 2 non-polypeptide moieties attached to the polypeptide. Some such conjugates 

30 comprise a polypeptide sequence which differs from SEQ ID NO:36 in 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, 
in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 
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1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1- 
8 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions. In some instances, the polypeptide part of the conjugate further 
5 comprises one or more substitution(s) relative to SEQ ID NO:36 selected from the group 
consisting of M21A; I24P; P26L; H47Q; F48A/L; T51V/P; S55P/F/A; V56L; H58Y; 
L60M; F65A; F68P; F90Y/A; M93L/P; N95D; L111A; N113K; V114E/P; F124A; 
R125Q; I127P; T132K; and L154F. Some such conjugates exhibit an interferon-alpha 
activity (e.g., antiviral activity, T H 1 differentiation activity, and/or antiproliferative 

10 activity). Some such conjugates further comprise one or more additional amino acid(s), 
such as a methionine added to the N-terminus of the polypeptide. 

The invention also provides conjugates which each comprise a polypeptide 
comprising a sequence which (a) differs in 0 to 16 amino acid positions from SEQ ID 
NO:38 and (b) comprises one or more of: Ala at position 21 ; Pro at position 24; Ala or 

15 Leu at position 48; Pro at position 51; Ala at position 55; Ala at position 65; Pro at 
position 68; Ala at position 90; Pro at position 93; Ala at position 111; Pro at position 
1 14; Ala at position 124; Pro at position 127; and Glu at position 160 (position numbering 
relative to SEQ ID NO:38), and at least one non-polypeptide moiety attached to the 
polypeptide, such as e.g., 1-6, 1-5, 1-4, 1-3, e.g. 1 or 2 non-polypeptide moieties attached 

20 to the polypeptide. Some such conjugates comprise a polypeptide comprising a sequence 
which differs from SEQ ID NO:38 in 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 
16 amino acid positions, e.g., in 0-15 amino acid positions, in 0-14 amino acid positions, 
in 0-13 amino acid positions, in 0-12 amino acid positions, in 0-1 1 amino acid positions, 
in 0-10 amino acid positions, in 0-9 amino acid positions, in 0-8 amino acid positions, in 

25 0-7 amino acid positions, in 0-6 amino acid positions, in 0-5 amino acid positions, in 0-4 
amino acid positions, in 0-3 amino acid positions, in 0-2 amino acid positions, or in 0-1 
amino acid positions. Some such conjugates comprise a polypeptide sequence which 
comprises an Ala or a Leu at position 48. Some such conjugates comprise a polypeptide 
sequence which comprises an Ala at position 48. In some instances, the polypeptide part 

30 of the conjugate further comprises one or more substitution(s) relative to SEQ ID NO:38 
selected from the group consisting of P26L; H47Q; V51T; F55P/S; L56V; Y58H; L60M; 
F90Y; M93L; N95D; N113K; V114E; R125Q; T132K; and F154L. Some such 
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conjugates exhibit an interferon-alpha activity (e.g., antiviral activity, ThI differentiation 
activity, and/or antiproliferative activity). Some such conjugates comprise a polypeptide 
sequence further comprising one or more additional amino acid(s), such as a methionine 
added to the N-terminus. 

5 The term "conjugate" (or interchangeably "polypeptide conjugate" or "conjugated 

polypeptide") is intended to indicate a heterogeneous (in the sense of composite) 
molecule formed by the covalent attachment of one or more polypeptides of the invention 
to one or more non-polypeptide moieties. The term "covalent attachment" means that the 
polypeptide and the non-polypeptide moiety are either directly covalently joined to one 

10 another, or else are indirectly covalently joined to one another through an intervening 
moiety or moieties, such as a bridge, spacer, or linkage moiety or moieties. Preferably, a 
conjugated polypeptide is soluble at relevant concentrations and conditions, i.e. soluble in 
physiological fluids such as blood. Examples of conjugated polypeptides of the invention 
include glycosylated and/or PEGylated polypeptides. The term "non-conjugated 

15 polypeptide" may be used to refer to the polypeptide part of the conjugated polypeptide. 

The term "non-polypeptide moiety" is intended to mean a molecule that is capable of 
conjugating to an attachment group of the polypeptide. Preferred examples of non- 
polypeptide moieties include polymer molecules, sugar moieties, lipophilic compounds, 
or organic derivatizing agents, in particular polymer molecules or sugar moieties. It will 

20 be understood that the non-polypeptide moiety is linked to the polypeptide through an 
attachment group of the polypeptide. Except where the number of non-polypeptide 
moieties, such as polymer molecule(s), attached to the polypeptide is expressly indicated, 
every reference to "a non-polypeptide moiety" attached to the polypeptide or otherwise 
used in the present invention shall be a reference to one or more non-polypeptide moieties 

25 attached to the polypeptide. 

The term "polymer molecule" is defined as a molecule formed by covalent linkage 
of two or more monomers, wherein none of the monomers is an amino acid residue. The 
term "polymer" may be used interchangeably with the term "polymer molecule". 

The term "sugar moiety" is intended to indicate a carbohydrate molecule attached by 

30 in vivo or in vitro glycosylation, such as N- or O-glycosylation. 
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An "N-glycosylation site" has the sequence N-X-S/T/C, wherein X is any amino acid 
residue except proline, N is asparagine and S/T/C is either serine, threonine or cysteine, 
preferably serine or threonine, and most preferably threonine. 

An "O-glycosylation site" comprises the OH-group of a serine or threonine residue. 
5 The term "attachment group" is intended to indicate an amino acid residue group 

capable of coupling to the relevant non-polypeptide moiety such as a polymer molecule 
or a sugar moiety. Non-limiting examples of useful attachment groups and some 
corresponding non-polypeptide moieties are provided in Table 4 below. 

10 Table 4 



Useful attachment groups and examples of corresponding non-polypeptide moieties 



Attachment 
group 


Amino acid 


Examples of non- 
polypeptide moieties 


Examples of 
conjugation 
method / activated 
PEG 


Reference 


-NH 2 


N-terminus, Lys 


Polymer, e.g. PEG 


mPEG-SPA 

mPEG2-NHS 

mPEG2- 

butryALD 


Nektar Inc. 2003 
Catalog 


-COOH 


C-terminus, Asp, 
Glu 


Polymer, e.g. PEG 
Sugar moiety 


mPEG-Hz 

In vitro coupling 


Nektar Inc. 2003 
Catalog 


-SH 


Cys 


Polymer, e.g. PEG, 
Sugar moiety 


mPEG-VS 
mPEG2-MAL 

In vitro coupling 


Nektar Inc. 2003 
Catalog; 
Delgado et al, 
Critical Reviews 
in Therapeutic 
Drug Carrier 
Systems 
9(3,4):249-304 
(1992) 


-OH 


Ser, Thr, OH- 


Sugar moiety 


In vivo O-l inked 
glycosylation 




-CONH 2 


Asn as part of an 

N-glycosylation 

site 


Sugar moiety 


In vivo N- 
glycosylation 




Aromatic 
residue 


Phe, Tyr, Trp 


Sugar moiety 


In vitro coupling 





61 



WO 2005/113592 



PCT/US2005/0 17471 



Attachment 
group 


Amino acid 


Examples of non- 
polypeptide moieties 


Examples of 
conjugation 
method / activated 
PEG 


Reference 


-CONH 2 


Gin 


Sugar moiety 


In vitro coupling 


Yan and Wold, 
Biochemistry, 
1984, Jul 31; 
23(16): 3759-65 


Aldehyde 
Ketone 


Oxidized 
carbohydrate 


Polymer, e.g. PEG, 
PEG-hydrazide 


PEGylation 


Andresz et al., 
1978, Makromol. 
Chem. 179:301; 
WO 92/16555, 
WO 00/23114 


Guanidino 


Arg 


Sugar moiety 


In vitro coupling 


Lundblad and 
Noyes, Chemical 
Reagents for 
Protein 
Modification, 
CRC Press Inc. 
Boca Raton, FI 


Imidazole ring 


His 


Sugar moiety 


In vitro coupling 


As for guanidine 



For in vivo N-glycosylation, the term "attachment group" is used in an 
unconventional way to indicate the amino acid residues constituting an N-glycosylation 
site (with the sequence N-X-S/T/C, wherein X is any amino acid residue except proline, 
5 N is asparagine and S/T/C is either serine, threonine or cysteine, preferably serine or 
threonine, and most preferably threonine). Although the asparagine residue of the N- 
glycosylation site is the one to which the sugar moiety is attached during glycosylation, 
such attachment cannot be achieved unless the other amino acid residues of the N- 
glycosylation site is present. Accordingly, when the non-polypeptide moiety is a sugar 

10 moiety and the conjugation is to be achieved by N-glycosylation, the term "amino acid 
residue comprising an attachment group for the non-polypeptide moiety" as used in 
connection with alterations of the amino acid sequence of the polypeptide of the invention 
is to be understood as one, two or all of the amino acid residues constituting an N- 
glycosylation site is/are to be altered in such a manner that either a functional N- 

1 5 glycosylation site is introduced into the amino acid sequence, removed from said 

sequence or a functional N-glycosylation site is retained in the amino acid sequence (e.g. 
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by substituting a serine residue, which already constitutes part of an N-glycosylation site, 
with a threonine residue and vice versa). 

The term 'introduce" (i.e., an "introduced" amino acid residue, "introduction" of an 
amino acid residue) is primarily intended to mean substitution of an existing amino acid 
5 residue for another amino acid residue, but may also mean insertion of an additional 
amino acid residue. 

The term "remove" (i.e., a "removed" amino acid residue, "removal" of an amino 
acid residue) is primarily intended to mean substitution of the amino acid residue to be 
removed for another amino acid residue, but may also mean deletion (without 

10 substitution) of the amino acid residue to be removed. 

The term "amino acid residue comprising an attachment group for the non- 
polypeptide moiety" is intended to indicate that the amino acid residue is one to which the 
non-poly peptide moiety binds (in the case of an introduced amino acid residue) or would 
have bound (in the case of a removed amino acid residue). 

15 The term "functional in vivo half-life" is used in its normal meaning, i.e. the time at 

which 50% of the biological activity of the polypeptide is still present in the body/target 
organ, or the time at which the activity of the polypeptide is 50% of the initial value. The 
functional in vivo half-life may be determined in an experimental animal, such as rat, 
mice, rabbit, dog or monkey. Preferably, the functional in vivo half -life is determined in a 

20 non-human primate, such as a monkey. Furthermore, the functional in vivo half-life may 
be determined for a sample that has been administered intravenously or subcutaneously. 

As an alternative to determining functional in vivo half-life, "serum half-life" may be 
determined, i.e. the time at which 50% of the polypeptide circulates in the plasma or 
bloodstream prior to being cleared. Determination of serum half-life is often more simple 

25 than determining the functional in vivo half-life and the magnitude of serum half-life is 
usually a good indication of the magnitude of functional in vivo half-life. Alternatively 
terms to serum half-life include "plasma half-life", "circulating half-life", "serum 
clearance", "plasma clearance" and "clearance half-life". The serum half-life may be 
determined as described above in connection with determination of functional in vivo 

30 half-life. 

The term "serum" is used in its normal meaning, i.e. as blood plasma without 
fibrinogen and other clotting factors. 
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The term "increased" as used about the functional in vivo half-life or serum half-life 
is used to indicate that the relevant half-life of the conjugate of the invention is 
statistically significantly increased relative to that of a reference molecule, such as a wild- 
type interferon-alpha, e.g., a human interferon-alpha, such as one of SEQ ID NO:45-SEQ 
5 ID NO:56 (or other huIFN-alpha sequences as described herein and/or in Allen G. and 
Diaz M.O. (1996), supra), or the corresponding non-conjugated polypeptide. Thus, 
interesting conjugates of the invention include those which have an increased functional 
in vivo half-life or an increased serum half -life as compared to a reference molecule 
mentioned above. 

10 The term "AUC SC " or "Area Under the Curve when administered subcutaneously" is 

used in its normal meaning, i.e. as the area under the interferon-alpha-activity-in-serum 
vs. time curve, where the conjugated molecule has been administered subcutaneously to 
an experimental animal. Once the experimental interferon-alpha activity time points have 
been determined, the AUC SC may conveniently be calculated by a computer program, such 

15 as GraphPad Prism 3.01. 

The term "increased" as used about the AUC SC is used to indicate that the Area 
Under the Curve for a conjugate of the invention, when administered subcutaneously, is 
statistically significantly increased relative to that of a reference molecule, such as wild- 
type interferon-alpha, e.g., a human interferon-alpha, such as one of SEQ ID NO:45-SEQ 

20 ID NO:56 (or other huIFN-alpha sequences as described herein and/or in Allen G. and 
Diaz M.O. (1996), supra), or the corresponding non-conjugated polypeptide, when 
determined under comparable conditions. Evidently, the same amount of interferon-alpha 
activity should be administered for the conjugate of the invention and the reference 
molecule. Consequently, in order to make direct comparisons between different 

25 interferon-alpha molecules, the AUC SC values should typically be normalized, i.e. be 
expressed as AUC sc /dose administered. 

The term "T maXjSC " is used about the time point in the interferon-alpha-activity-in- 
serum vs. time curve where the highest interferon-alpha activity in serum is observed. 
It will be understood that while the examples and modifications to the parent 

30 polypeptide are generally provided herein in regards to the sequence SEQ ID NO: 1, the 
disclosed modifications may also be made in equivalent amino acid positions of any of 
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the other polypeptides of the invention (including SEQ ID NOs:2-35 and 37-44 and 
variants thereof) described above. 

By removing and/or introducing amino acid residues comprising an attachment 
group for the non-polypeptide moiety it is possible to specifically adapt the polypeptide 
5 so as to make the molecule more susceptible to conjugation to the non-polypeptide 
moiety of choice, to optimize the conjugation pattern (e.g. to ensure an optimal 
distribution of non-polypeptide moieties on the surface of the interferon-alpha molecule 
and thereby, e.g., effectively shield epitopes and other surface parts of the polypeptide 
without significantly impairing the function thereof). For instance, by introduction of 

10 attachment groups, the interferon-alpha polypeptide is altered in the content of the 
specific amino acid residues to which the relevant non-polypeptide moiety binds, 
whereby a more efficient, specific and/or extensive conjugation is achieved. By removal 
of one or more attachment groups it is possible to avoid conjugation to the non- 
polypeptide moiety in parts of the polypeptide in which such conjugation is 

15 disadvantageous, e.g. to an amino acid residue located at or near a functional site of the 
polypeptide (since conjugation at such a site may result in inactivation or reduced 
interferon- alpha activity of the resulting conjugate due to impaired receptor recognition). 
Further, it may be advantageous to remove an attachment group located close to another 
attachment group. 

20 It will be understood that the amino acid residue comprising an attachment group for 

a non-polypeptide moiety, whether it be removed or introduced, is selected on the basis of 
the nature of the non-polypeptide moiety and, in some instances, on the basis of the 
conjugation method to be used. For instance, when the non-polypeptide moiety is a 
polymer molecule, such as a polyethylene glycol or polyalkylene oxide derived molecule, 

25 amino acid residues capable of functioning as an attachment group may be selected from 
the group consisting of cysteine, lysine (and/or the N-terminal amino group of the 
polypeptide), aspartic acid, glutamic acid, histidine and arginine. When the non- 
polypeptide moiety is a sugar moiety, the attachment group is an in vivo or in vitro N- or 
O-glycosylation site, preferably an N-glycosylation site. 

30 In some instances, when an attachment group for a non-polypeptide moiety is to be 

introduced into or removed from the interferon-alpha polypeptide, the position of the 
interferon-alpha polypeptide to be modified may be conveniently selected as follows: 
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The position to be modified may be located at the surface of the interferon-alpha 
polypeptide, such as a position occupied by an amino acid residue which has more than 
25% of its side chain exposed to the solvent, such as more than 50% of its side chain 
exposed to the solvent. Such positions have been identified on the basis of an analysis of 
5 a 3D structure of the human interferon-alpha 2a molecule as described in the "Materials 
and Methods" section herein. 

In order to determine an optimal distribution of attachment groups, the distance 
between amino acid residues located at the surface of the interferon-alpha molecule was 
calculated on the basis of a 3D structure of an interferon-alpha polypeptide. More 

10 specifically, the distance between the CB's of the amino acid residues comprising such 
attachment groups, or the distance between the functional group (NZ for lysine, CG for 
aspartic acid, CD for glutamic acid, SG for cysteine) of one and the CB of another amino 
acid residue comprising an attachment group were determined. In case of glycine, CA 
was used instead of CB. In the interferon-oc polypeptide part of a conjugate of the 

15 invention, any of said distances may be more than 8 A, such as more than 10 A in order to 
avoid or reduce heterogeneous conjugation and to provide a uniform distribution of 
attachment groups, e.g. with the aim of epitope shielding. 

Furthermore, in the interferon-alpha polypeptide part of a conjugate of the invention, 
in some instances attachment groups located at or near the receptor binding sites of 

20 interferon-alpha are removed, such as by substitution of the amino acid residue 

comprising such group. In some instances, amino acid residues comprising an attachment 
group for a non-polypeptide moiety, such as cysteine or lysine, are often not introduced at 
or near the receptor binding site of the interferon alpha molecule. 

Another approach for modifying an interferon-alpha polypeptide is to shield and 

25 thereby modify or destroy or otherwise inactivate an epitope present in the parent 

interferon-alpha, by conjugation to a non-polypeptide moiety. Epitopes of interferon- 
alpha polypeptides may be identified by use of methods known in the art, also known as 
epitope mapping, see e.g. Romagnoli et al., J. Biol Chem., 1999, 380(5):553-9, DeLisser 
HM, Methods Mol Biol, 1999, 96: 1 1-20, Van de Water et al., Clin Immunol 

30 ImmunopathoU 1997, 85(3):229-35, Saint-Remy JM, Toxicology, 1997, 1 19(1):77-81, and 
Lane DP and Stephen CW, Curr Opin Immunol, 1993, 5(2):268-71. One method is to 
establish a phage display library expressing random oligopeptides of, e.g., 9 amino acid 

66 



WO 2005/113592 



PCT/US2005/017471 



residues. IgGl antibodies from specific antisera towards human interferon-alpha are 
purified by immunoprecipitation and the reactive phages are identified by 
immunoblotting. By sequencing the DNA of the purified reactive phages, the sequence of 
the oligopeptide can be determined followed by localization of the sequence on the 3D- 
5 structure of the interferon-alpha. Alternatively, epitopes can be identified according to the 
method described in US Patent 5,041,376. The thereby identified region on the structure 
constitutes an epitope that then can be selected as a target region for introduction of an 
attachment group for the non-polypeptide moiety. Preferably, at least one epitope, such as 
two, three or four epitopes of interferon-alpha are shielded by a non-polypeptide moiety 

10 according to the present invention. Accordingly, in one aspect, the conjugate of the 

invention has at least one shielded epitope as compared to a wild type human interferon- 
alpha, including any commercially available interferon- alpha. This may be done by 
introduction of an attachment group for a non-polypeptide moiety into a position located 
in the vicinity of (i.e. within 4 amino acid residues in the primary sequence or within 

15 about 10 A in the tertiary sequence) of a given epitope. The 10 A distance is measured 
between CB's (CA's in case of glycine). Such specific introductions are described in the 
following sections. 

In case of removal of an attachment group, the relevant amino acid residue 
comprising such group and occupying a position as defined above may be substituted 

20 with a different amino acid residue that does not comprise an attachment group for the 
non-polypeptide moiety in question, or may be deleted. Removal of an N-glycosylation 
group, may also be accomplished by insertion or removal of an amino acid reside within 
the motif N-X-S/T/C. 

In case of introduction of an attachment group, an amino acid residue comprising 

25 such group is introduced into the position, such as by substitution of the amino acid 
residue occupying such position. 

The exact number of attachment groups available for conjugation and present in the 
interferon-alpha polypeptide is dependent on the effect desired to be achieved by 
conjugation. The effect to be obtained is, e.g., dependent on the nature and degree of 

30 conjugation (e.g. the identity of the non-polypeptide moiety, the number of non- 
polypeptide moieties desirable or possible to conjugate to the polypeptide, where they 
should be conjugated or where conjugation should be avoided, etc.). For instance, if 
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reduced immunogenicity is desired, the number (and location of) attachment groups 
should be sufficient to shield most or all epitopes. This is normally obtained when a 
greater proportion of the interferon-alpha polypeptide is shielded. Effective shielding of 
epitopes is normally achieved when the total number of attachment groups available for 

5 conjugation is in the range of 1-6 attachment groups, e.g., 1-5, such as in the range of 1- 
3, such as 1, 2, or 3 attachment groups. 

Functional in vivo half-life is i.a. dependent on the molecular weight of the 
conjugate, and the number of attachment groups needed for providing increased half-life 
thus depends on the molecular weight of the non-polypeptide moiety in question. Some 

10 such conjugates comprise 1-6, e.g., 1-5, such as 1-3, e.g. 1, 2, or 3 non-polypeptide 

moieties each having a MW of about 2-40 kDa, such as about 2 kDa, about 5 kDa, about 
12 kDa, about 15 kDa, about 20 kDa, about 30 kDa, or about 40 kDa. 

In the conjugate of the invention, some, most, or substantially all conjugatable 
attachment groups are occupied by the relevant non-polypeptide moiety. 

15 The conjugate of the invention may exhibit one or more of the following improved 

properties: 

For example, the conjugate may exhibit a reduced immunogenicity as compared to a 
human interferon-alpha (such as any of the polypeptides defined herein as SEQ ID 
NO:45-57, or any other huIFN-alpha described herein and/or in Allen G. and Diaz M.O. 

20 (1996), supra) or as compared to the corresponding non-conjugated polypeptide, e.g. a 
reduction of at least 10%, such as a reduction of at least of 25%, such as a reduction of at 
least of 50%, e.g. a reduction of at least 75% compared to the non-conjugated polypeptide 
or compared to a human interferon-alpha. 

In another aspect the conjugate may exhibit a reduced reaction or no reaction with 

25 neutralizing antibodies from patients treated with a human interferon-alpha (such as any 
of the polypeptides defined herein as SEQ ID NO: 45-57, or any other huIFN-alpha 
described herein and/or in Allen G. and Diaz M.O. (1996), supra) or as compared to the 
corresponding non-conjugated polypeptide, e.g. a reduction of neutralisation of at least 
10%, such as at least of 25%, such as of at least 50%, e.g., at least 75%. 

30 In another aspect of the invention the conjugate may exhibit an increased functional 

in vivo half-life and/or increased serum half-life as compared to a reference molecule 
such as a human interferon-alpha (e.g. any of the polypeptides defined herein as SEQ ID 
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NO:45-57 or any other huIFN-alpha described herein and/or in Allen G. and Diaz M.O. 
(1996), supra) or as compared to the corresponding non-conjugated polypeptide. 
Particular preferred conjugates are such conjugates where the ratio between the functional 
in vivo half-life (or serum half-life) of said conjugate and the functional in vivo half-life 
5 (or serum half-life) of said reference molecule is at least 1.25, such as at least 1.50, such 
as at least 1.75, such as at least 2, such as at least 3, such as at least 4, such as at least 5, 
such as at least 6, such as at least 7, such as at least 8. As mentioned above, the half-life is 
conveniently determined in an experimental animal, such as rat or monkey, and may be 
based on intravenously or subcutaneously administration. 
10 In a further aspect the conjugate may exhibit an increased bioavailability as 

compared to a reference molecule such as a human interferon-alpha (e.g. any of the 
polypeptides defined herein as SEQ ID NO:45-57, or any other huIFN-alpha described 
herein and/or in Allen G. and Diaz M.O. (1996), supra) or the corresponding non- 
conjugated polypeptide. For example, the conjugate may exhibit an increased AUC SC as 
15 compared to a reference molecule such as a human interferon-alpha or the corresponding 
non-conjugated polypeptide. Thus, exemplary conjugates are such conjugates where the 
ratio between the AUC SC of said conjugate and the AUC SC of said reference molecule is at 
least 1.25, such as at least 1.5, such as at least 2, such as at least 3, such as at least 4, such 
as at least 5 or at least 6, such as at least 7, such as at least 8, such as at least 9 or at least 
20 10, such as at least 12, such as at least 14, e.g. at least 16, at least 18 or at least 20 when 
administered subcutaneously, in particular when administered subcutaneously in an 
experimental animal such as rat or monkey. Analogously, some conjugates of the 
invention are such conjugates wherein the ratio between T max for said conjugate and T max 
for said reference molecule, such as a human interferon-alpha or the corresponding non- 
25 conjugated polypeptide, is at least 1.2, such as at least 1.4, e.g. at least 1.6, such as at least 
1.8, such as at least 2, e.g. at least 2.5, such as at least 3, such as at least 4, e.g. at least 5, 
such as at least 6, such as at least 7, e.g. at least 8, such as at least 9, such as at least 10, 
when administered subcutaneously, in particular when administered subcutaneously in an 
experimental animal such as rat or monkey. 
30 In some instances, the magnitude of the antiviral activity of a conjugate of the 

invention may be reduced (e.g. by at least about 75%, at least about 50%, at least about 
25%, at least about 10%) or increased (e.g. by at least about 10 %) or is about equal (e.g. 
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within about +/- 10% or about +/- 5%) to that of a human interferon-alpha (e.g. any of 
the polypeptides identified herein as SEQ ID NO:45-57, or any other huIFN-alpha 
described herein and/or in Allen G. and Diaz M.O. (1996), supra) or to that of the 
corresponding non-conjugated polypeptide. In some instances the degree of antiviral 
5 activity as compared to antiproliferative activity of a conjugate of the invention may vary, 
and thus be higher, lower or about equal to that of a human interferon-alpha or to that of 
the corresponding non-conjugated polypeptide. 

Conjugate of the invention where the non-polypeptide moiety binds to a lysine residue or 
the N-terminal amine 

10 In one aspect, the invention relates to a conjugate exhibiting an interferon-alpha 

activity and comprising at least one non-polypeptide moiety conjugated to at least one 
lysine residue and/or to the N-terminal amino group of an interferon-alpha polypeptide 
comprising a sequence selected from SEQ ID NOs:2-35 and 37-44. Some such conjugates 
comprise a non-polypeptide moiety attached to a lysine residue selected from K31, K50, 

15 K71, K84, K122, K134, K 135, and K 165; such as, for example, a lysine residue 
selected from K31, K122, K135, and K165. Some such conjugates comprise a non- 
polypeptide moiety attached to a lysine residue selected from K31, K122, and K135. 
Some such conjugates comprise a non-polypeptide moiety attached to a lysine residue 
selected from K122 and K135. Some such conjugates comprise a non-polypeptide moiety 

20 attached to a lysine residue selected from K31 and K122; or K31 and K135. 

In another aspect, the invention relates to a conjugate exhibiting an interferon-alpha 
activity and comprising at least one non-polypeptide moiety conjugated to at least one 
lysine residue, or to the N-terminal amino group, of an interferon-alpha polypeptide 
comprising a sequence which (a) differs in 1 to 16 amino acid positions (such as in 1-15 

25 amino acid positions, in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 
amino acid positions, 1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 
amino acid positions, in 1-8 amino acid positions, in 1-7 amino acid positions, in 1-6 
amino acid positions, in 1-5 amino acid positions, in 1-4 amino acid positions, in 1-3 
amino acid positions, or in 1-2 amino acid positions) from SEQ ID NO: 1 and (b) 

30 comprises one or more substitutions relative to SEQ ID NO:l selected from the group 
consisting of F48A/L; V51P; F55A; F65A; F68P; L111A; and V114P. Some such 
conjugates comprise a polypeptide sequence further comprising one or more additional 
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amino acid(s), such as a methionine added to the N-terminus. Some conjugates according 
to this aspect comprise at least one removed lysine residue and/or at least one removed 
histidine residue, and/or at least one introduced lysine residue. 

The invention also provides a conjugate exhibiting an interferon-alpha activity and 
5 comprising at least one non-polypeptide moiety conjugated to at least one lysine residue, 
or to the N-terminal amino group, of an interferon-alpha polypeptide comprising a 
sequence which (a) differs in 0 to 16 amino acid positions from SEQ ID NO: 13 (such as 
in 0-15 amino acid positions, in 0-14 amino acid positions, in 0-13 amino acid positions, 
in 0-12 amino acid positions, in 0-1 1 amino acid positions, in 0-10 amino acid positions, 

10 in 0-9 amino acid positions, in 0-8 amino acid positions, in 0-7 amino acid positions, in 
0-6 amino acid positions, in 0-5 amino acid positions, in 0-4 amino acid positions, in 0-3 
amino acid positions, in 0-2 amino acid positions, or in 0-1 amino acid positions) and (b) 
comprises one or more of: Ala or Leu at position 48; Pro at position 51; Ala at position 
55; Ala at position 65; Pro at position 68; Ala at position 111; and Pro at position 1 14 

15 (position numbering relative to SEQ ED NO: 13). Some such conjugates comprise a 
polypeptide sequence which comprises an Ala or a Leu at position 48. Some such 
conjugates comprise a polypeptide sequence which comprises an Ala at position 48. In 
some instances, the polypeptide part of the conjugate further comprises one or more 
substitution(s) relative to SEQ ID NO: 13 selected from the group consisting of H47Q; 

20 V51A; Q52P/E; A53T; F55S; L56V; F57L; Y58H; M61I; N113K; V114E; and E160D. 
Some such conjugates comprise a polypeptide sequence further comprising one or more 
additional amino acid(s), such as a methionine added to the N-terminus. Some conjugates 
according to this aspect comprise at least one removed lysine residue and/or at least one 
removed histidine residue, and/or at least one introduced lysine residue. 

25 In another aspect, the invention relates to a conjugate exhibiting an interferon-alpha 

activity and comprising at least one non-polypeptide moiety conjugated to at least one 
lysine residue, or to the N-terminal amino group, of an interferon-alpha polypeptide 
comprising a sequence which (a) differs in 1 to 16 amino acid positions (such as in 1-15 
amino acid positions, in 1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 

30 amino acid positions, 1-11 amino acid positions, in 1-10 amino acid positions, in 1-9 
amino acid positions, in 1-8 amino acid positions, in 1-7 amino acid positions, in 1-6 
amino acid positions, in 1-5 amino acid positions, in 1-4 amino acid positions, in 1-3 
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amino acid positions, or in 1-2 amino acid positions) from SEQ ID NO: 36 and (b) 
comprises one or more substitutions relative to SEQ ID NO: 36 selected from the group 
consisting of M21 A; I24P; F48A/L; T51P; S55A; F65A; F68P; F90A; M93P; LI 1 1 A; 
VI 14P; F124A; I127P; and E160D. Some such conjugates comprise a polypeptide 
5 sequence further comprising one or more additional amino acid(s), such as a methionine 
added to the N-terminus. Some conjugates according to this aspect comprise at least one 
removed lysine residue and/or at least one removed histidine residue, and/or at least one 
introduced lysine residue. 

The invention also provides a conjugate exhibiting an interferon-alpha activity and 

10 comprising at least one non-polypeptide moiety conjugated to at least one lysine residue, 
or to the N-terminal amino group, of an interferon-alpha polypeptide comprising a 
sequence which (a) differs in 0 to 16 amino acid positions (such as in 0-15 amino acid 
positions, in 0-14 amino acid positions, in 0-13 amino acid positions, in 0-12 amino acid 
positions, in 0-1 1 amino acid positions, in 0-10 amino acid positions, in 0-9 amino acid 

15 positions, in 0-8 amino acid positions, in 0-7 amino acid positions, in 0-6 amino acid 
positions, in 0-5 amino acid positions, in 0-4 amino acid positions, in 0-3 amino acid 
positions, in 0-2 amino acid positions, or in 0-1 amino acid positions) from SEQ ID 
NO:38 and (b) comprises one or more of: Ala at position 21; Pro at position 24; Ala or 
Leu at position 48; Pro at position 51; Ala at position 55; Ala at position 65; Pro at 

20 position 68; Ala at position 90; Pro at position 93; Ala at position 111; Pro at position 

114; Ala at position 124; Pro at position 127; and Glu at position 160 (position numbering 
relative to SEQ ID NO:38). Some such conjugates comprise a polypeptide sequence 
which comprises an Ala or a Leu at position 48. Some such conjugates comprise a 
polypeptide sequence which comprises an Ala at position 48. In some instances, the 

25 polypeptide part of the conjugate further comprises one or more substitution(s) relative to 
SEQ ID NO:38 selected from the group consisting of P26L; H47Q; V51T; F55P/S; 
L56V; Y58H; L60M; F90Y; M93L; N95D; N113K; V114E; R125Q; T132K; and F154L. 
Some such conjugates comprise a polypeptide sequence further comprising one or more 
additional amino acid(s), such as a methionine added to the N-terminus. Some conjugates 

30 according to this aspect comprise at least one removed lysine residue and/or at least one 
removed histidine residue, and/or at least one introduced lysine residue. 
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Some conjugates of the invention comprise a polypeptide sequence comprising a 
substitution of an amino acid residue for a different amino acid residue, or a deletion of 
an amino acid residue, which removes one or more lysines, such as, one or more of K31, 
K50, K71, K84, K122, K133, K134, K135, and/or K165 from a polypeptide of the 
5 invention such as, for example, one of SEQ ID NOs:2-35 and 37-44. The one or more 
lysine residue(s) to be removed may be substituted with any other amino acid, may be 
substituted with an Arg (R) or Gin (Q), or may be deleted. 

In instances where amine -reactive conjugation chemistries are employed, it may be 
advantageous to avoid or to minimize the potential for conjugation to histidine residues. 

10 Therefore, some conjugates of the invention comprise a polypeptide sequence 

comprising a substitution or a deletion which removes one or more histidines, e.g., H7, 
HI 1, H34, and/or H47 (relative to SEQ ID NO:l) from any polypeptide sequence of the 
invention such as, for example, one of one of SEQ ID NOs:2-35 and 37-44. The one or 
more histidine residue(s) to be removed may be substituted with any other amino acid, 

15 may be substituted with an Arg (R) or Gin (Q), or may be deleted. Some such conjugates 
comprise the substitutions H34Q; H47Q; or H34Q+H47Q. 

Alternatively, or in addition, some conjugates of the invention comprise a 
polypeptide sequence comprising a modification which introduces a lysine into a position 
that is occupied in the parent sequence (e.g., one of SEQ ID NOs:2-35 and 37-44) by an 

20 amino acid residue that is exposed to the surface of the molecule, e.g., one that has at 
least 25%, such as at least 50% of its side chain exposed to the surface. Some such 
conjugates comprise a polypeptide sequence comprising one or more of the following 
substitutions, relative to SEQ ID NO:l, which introduces a lysine residue into a position 
which is predicted to be exposed at the surface of the molecule with more than a 25% 

25 fractional ASA: D2K, L3K, P4K, Q5K, T6K, H7K, S8K, L9K, G10K, R12K, R13K, 
M16K, A19K, Q20K, R22K, R23K, I24K, S25K, L26K, F27K, S28K, L30K, R33K, 
H34K, D35K, R37K, Q40K, E41K, E42K, D44K, N46K, H47K, Q49K, V51K, Q52K, 
E59K, Q62K, Q63K, N66K, S69K, T70K, D72K, S74K, A75K, D78K, E79K, T80K, 
L81K, E83K, I87K, F90K, Q91K, N94K, D95K, E97K, A98K, V100K, M101K, Q102K, 

30 E103K, V104K, G105K, E107K, E108K, T109K, PI 10K, LI 1 IK, Ml 12K, Nl 13K, 
VI 14K, Dl 15K, LI 18K, R121K, Q125K, R126K, T128K, L129K, T132K, E133K, 
Y136K, S137K, P138K, A146K, M149K, R150K, S153K, F154K, N157K, Q159K, 
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E160K, S161K, L162K, R163K, S164K, and E166K, said amino acid residue positions 
relative to SEQ ID NO: 1 : 

Some such conjugates of the invention comprise a polypeptide sequence comprising 
one or more of the following substitutions, relative to SEQ ID NO: 1, which introduces a 
5 lysine residue into a position which is predicted to be exposed at the surface of the 
molecule with more than a 50% fractional ASA: D2K, L3K, P4K, Q5K, T6K, H7K, 
S8K, L9K, R12K, R13K, M16K, A19K, S25K, F27K, S28K, R33K, H34K, D35K, 
R37K, E41K, D44K, N46K, H47K, Q49K, N66K, A75K, D78K, E79K, T80K, E83K, 
I87K, F90K, Q91K, N94K, D95K, M101K, Q102K, E103K, G105K, E107K, E108K, 

10 T109K, PI 10K, LI 1 IK, VI 14K, Dl 15K, LI 18K, R121K, Q125K, R126K, L129K, 

T132K, E133K, P138K, R150K, E160K, L162K, R163K, S164K and E166K, said amino 
acid residue positions relative to SEQ ID NO: 1. 

It is to be understood that while the examples of modifications to the parent 
polypeptide are generally provided herein relative to the sequence SEQ ID NO:l (or 

15 relative to some other specified sequence), the disclosed modifications may also be made 
in equivalent amino acid positions of the other polypeptides of the invention (including 
SEQ ID NOs:2-35 and 37-44 and variants thereof) described herein. Thus, as an 
example, the substitution H47K relative to SEQ ID NO: 1 is understood to correspond to 
Q47K in SEQ ID NO:37, and so on. 

20 Non-polypeptide moieties contemplated for this aspect of the invention include 

polymer molecules, such as any of the molecules mentioned in the section entitled 
"Conjugation to a polymer molecule", such as PEG or mPEG or mPEG2. The 
conjugation between the lysine-containing polypeptide and the polymer molecule may be 
achieved in any suitable manner, e.g. as described in the section entitled "Conjugation to 

25 a polymer molecule", e.g. in using a one step method or in the stepwise manner referred 
to in said section. An exemplary method for PEGylating the interferon-alpha polypeptide 
is to covalently attach PEG to lysine residues using lysine-reactive PEGs, such as 
described in Example 5 herein. A number of highly specific, lysine-reactive PEGs (such 
as for example, succinimidyl propionate (SPA), succinimidyl butanoate (SBA), N- 

30 hydroxylsuccinimide (NHS), and aldehyde (e.g., ButyrALD)) and different size linear or 
branched PEGs (e.g., 2-40 kDa, such as 2 kDa, 5 kDa, 12 kDa, 15 kDa, 20 kDa, 30 kDa, 
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or 40 kDa) are commercially available, e.g. from Nektar Therapeutics Inc., Huntsville, 
AL, USA, or SunBio, Anyang City, South Korea. 

In another aspect, the invention includes a composition comprising a population 
of conjugates wherein the majority of the conjugates of said population each contain a 
5 single non-polypeptide moiety (such as, a single polymer molecule, e.g., a single PEG, 
such as a linear PEG or a branched PEG ) covalently attached to a single lysine residue or 
N-terminal amino group of the polypeptide. For example, a "monoconjugated" (such as, 
a "monoPEGylated") composition of the invention comprises one or more "positional 
isomers" of said conjugate, wherein each positional isomer contains a single non- 

10 polypeptide moiety (e.g., a single PEG molecule) covalently attached to a single lysine 
residue of the polypeptide. 

The invention includes a monoPEGylated composition comprising a population of 
conjugates, wherein the majority of the conjugates of said population are positional 
isomers each containing a single PEG molecule (such as, a linear or branched PEG, such 

15 as a 2 kDa, 5 kDa, 12 kDa, 15 kDa, 20 kDa, 30 kDa, or 40 kDa mPEG or mPEG2 

molecule) covalently attached to a single lysine residue of a polypeptide of the invention. 

Some such positional isomers contain a single PEG molecule covalently attached to 
a lysine residue selected from K31, K50, K71, K84, K122, K134, K135, and K165. 
Some such positional isomers contain a single PEG molecule covalently attached to a 

20 lysine residue selected from K31, K122, K135, and K165. Some such positional isomers 
contain a single PEG molecule covalently attached to a lysine residue selected from K3 1 , 
K122, and K135, such as, a single PEG molecule covalently attached to a lysine reside 
selected from K31 and K122; K31 and K135; or K122 and K135. 

Example 5 herein describes the preparation of monoPEGylated compositions of the 

25 invention. A monoPEGylated composition of a polypeptide of the invention identified 
herein as 14epil8 (SEQ ID NO: 13) reacted with a 40 kDa branched PEG (mPEG2-NHS) 
. according to the procedure of Example 5 contains predominantly two positional isomers, 
one with a single PEG moiety covalently attached to Lysl22 and another with a single 
PEG moiety covalently attached to Lysl35. Accordingly, the invention includes a 

30 composition (e.g. a monoPEGylated composition), said composition comprising a 

polypeptide conjugate comprising the sequence SEQ ID NO: 13 wherein a PEG moiety is 
covalently attached to Lysl22 or Lysl35. The invention also includes a composition (e.g. 
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a monoPEGylated composition), said composition comprising a polypeptide conjugate 
comprising the sequence SEQ ID NO: 13 wherein a 40 kDa PEG moiety (e.g., a 40 kDa 
branched PEG moiety) is covalently attached to Lysl22 or Lysl35. 

A monoPEGylated composition of another exemplary polypeptide of the invention, 
identified herein as 25epil9 (SEQ ID NO:38), when reacted with a 40 kDa branched PEG 
(mPEG2-NHS) according to the procedure of Example 5 predominantly contains three 
positional isomers, one with a single PEG moiety covalently attached to Lys 31, another 
with a single PEG moiety covalently attached to Lys 122, and another with a single PEG 
moiety covalently attached to Lys 135. Accordingly, the invention includes a composition 
(e.g. a monoPEGylated composition), said composition comprising a polypeptide 
conjugate comprising the sequence SEQ ID NO: 38 wherein a PEG moiety is covalently 
attached to Lys31, Lys 122, or Lys 135. The invention also includes a composition (e.g. a 
monoPEGylated composition), said composition comprising a polypeptide conjugate 
comprising the sequence SEQ ID NO:38 wherein a 40 kDa PEG moiety (e.g., a 40 kDa 
branched PEG moiety) is covalently attached to Lys31, Lys 122, or Lys 135. 

Conjugate of the invention where the non-polypeptide moiety binds to a cysteine residue 

In one aspect, the invention relates to a conjugate exhibiting an interferon-alpha 
activity and comprising at least one non-polypeptide moiety conjugated to at least one 
cysteine residue of an interferon-alpha polypeptide comprising a sequence which (a) 
20 differs in 1 to 16 amino acid positions (such as in 1-15 amino acid positions, in 1-14 
amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 1-11 
amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1-8 
amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
25 amino acid positions) from SEQ ID NO: 1 and (b) comprises one or more substitutions 
relative to SEQ ID NO:l selected from the group consisting of F48A/L; V51P; F55A; 
F65A; F68P; LI 1 1 A; and VI 14P. Some conjugates according to this aspect comprise at 
least one introduced cysteine residue. 

In another aspect, the invention relates to a conjugate exhibiting an interferon-alpha 
30 activity and comprising at least one non-polypeptide moiety conjugated to at least one 
cysteine residue of an interferon-alpha polypeptide comprising a sequence which (a) 
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differs in 1 to 16 amino acid positions (such as in 1-15 amino acid positions, in 1-14 
amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 1-11 
amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1-8 
amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 

5 amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions) from SEQ ID NO:36 and (b) comprises one or more substitutions 
relative to SEQ ID NO:36 selected from the group consisting of M21A; I24P; F48A/L; 
T51P; S55A; F65A; F68P; F90A; M93P; LI 1 1 A; VI 14P; F124A; I127P; and E160D. 
Some conjugates according to this aspect comprise at least one introduced cysteine 

10 residue. 

For example, some conjugates of the invention comprise a polypeptide sequence 
further comprising one or more of the following substitutions, relative to SEQ ED NO:l, 
which introduces a cysteine residue into a position which is predicted to be exposed at the 
surface of the molecule with more than a 25% fractional ASA: D2C, L3C, P4C, Q5C, 

15 T6C, H7C, S8C, L9C, G10C, R12C, R13C, M16C, A19C, Q20C, R22C, R23C, I24C, 
S25C, L26C, F27C, S28C, L30C, K31C, R33C, H34C, D35C, R37C, Q40C, E41C, 
E42C, D44C, N46C, H47C, Q49C, K50C, V51C, Q52C, E59C, Q62C, Q63C, N66C, 
S69C, T70C, K71C, D72C, S74C, A75C, D78C, E79C, T80C, L81C, E83C, K84C, I87C, 
F90C, Q91C, N94C, D95C, E97C, A98C, V100C, M101C, Q102C, E103C, V104C, 

20 G105C, E107C, E108C, T109C, PI 10C, L111C, M112C, N113C, V114C, D115C, 
LI 18C, R121C, K122C, Q125C, R126C, T128C, L129C, T132C, E133C, K134C, 
K135C, Y136C, S137C, P138C, A146C, M149C, R150C, S153C, F154C, N157C, 
Q159C, E160C, S161C, L162C, R163C, S164C, K165C and E166C, said amino acid 
residue positions relative to SEQ ID NO:l. 

25 Some such conjugates of the invention comprise a polypeptide sequence further 

comprising one or more of the following substitutions, relative to SEQ ID NO:l, which 
introduces a cysteine residue into a position which is predicted to be exposed at the 
surface of the molecule with more than a 50% fractional ASA: D2C, L3C, P4C, Q5C, 
T6C, H7C, S8C, L9C, R12C, R13C, M16C, A19C, S25C, F27C, S28C, K31C, R33C, 

30 H34C, D35C, R37C, E41C, D44C, N46C, H47C, Q49C, K50C, N66C, K71C, A75C, 
D78C, E79C, T80C, E83C, K84C, I87C, F90C, Q91C, N94C, D95C, M101C, Q102C, 
E103C, G105C, E107C, E108C, T109C, PI 10C, LI 1 1C, VI 14C, Dl 15C, LI 18C, 
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R121C, K122C, Q125C, R126C, L129C, T132C, E133C, K135C, P138C, R150C, 
E160C, L162C, R163C, S164C, K165C and E166C, said amino acid residue positions 
relative to SEQIDNO:l. 

It is to be understood that while the examples of modifications to the parent 
5 polypeptide are generally provided herein relative to the sequence SEQ ID NO:l (or 

relative to some other specified sequence), the disclosed modifications may also be made 
in equivalent amino acid positions of the other polypeptides of the invention (including 
SEQ ID NOs:2-35 and 37-44 and variants thereof) described herein. Thus, as an 
example, the substitution H47C relative to SEQ ID NO: 1 is understood to correspond to 

10 Q47C in SEQ ID NO: 37, and so on. 

In some instances, only a single cysteine residue is introduced in order to avoid 
formation of disulfide bridges between two or more introduced cysteine residues. 

In interferon alphas, disulfide bonds are formed between cysteines at positions 1/99 
and 29/139. The disulfide bond 29/139 is essential for biological activity, while thel/99 

15 bond can be reduced without significantly affecting biological activity (Beilharz M.W. et 
ah (1986) J. Interferon Res. 6(6):677-685). Thus, in another aspect of the invention one 
of CI or C99 is removed, preferably by substitution, e.g. CIS or C99S, thereby leaving 
the other cysteine residue available for conjugation to a non-polypeptide moiety. 
Non-polypeptide moieties contemplated in this aspect of the invention include 

20 polymer molecules, such as any of the molecules mentioned in the section entitled 

"Conjugation to a polymer molecule", such as PEG or mPEG. The conjugation between 
the cysteine-containing polypeptide and the polymer molecule may be achieved in any 
suitable manner, e.g. as described in the section entitled "Conjugation to a polymer 
molecule", e.g. in using a one step method or in the stepwise manner referred to in said 

25 section. An exemplary method for PEGylating the interferon-alpha polypeptide is to 
covalently attach PEG to cysteine residues using cysteine -reactive PEGs. A number of 
highly specific, cysteine-reactive PEGs with different groups (e.g. orthopyridyl-disulfide 
(OPSS), maleimide (MAL) and vinylsulfone (VS)) and different size linear or branched 
PEGs (e.g., 2-40 kDa, such as 2 kDa, 5 kDa, 12 kDa, 15 kDa, 20 kDa, 30 kDa, or 40 kDa) 

30 are commercially available, e.g. from Nektar Therapeutics Inc., Huntsville, AL, USA, or 
SunBio, Anyang City, South Korea. 
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Non-poly peptide moiety of the conjugate of the invention 

As indicated above, the non-polypeptide moiety of the conjugate of the invention is 
generally selected from the group consisting of a polymer molecule, a lipophilic 
compound, a sugar moiety (e.g., by way of in vivo N-glycosylation) and an organic 
5 derivatizing agent. All of these agents may confer desirable properties to the polypeptide 
part of the conjugate, such as reduced immunogenicity, increased functional in vivo half- 
life, increased serum half-life, increased bioavailability and/or increased AUC SC . The 
polypeptide part of the conjugate is often conjugated to only one type of non-polypeptide 
moiety, but may also be conjugated to two or more different types of non-polypeptide 

10 moieties, e.g. to a polymer molecule and a sugar moiety, etc. The conjugation to two or 
more different non-polypeptide moieties may be done simultaneously or sequentially. The 
choice of non-polypeptide moiety/moieties, depends especially on the effect desired to be 
achieved by the conjugation. For instance, sugar moieties have been found particularly 
useful for reducing immunogenicity, whereas polymer molecules such as PEG are of 

1 5 particular use for increasing functional in vivo half-life and/or serum half-life. Using a 
combination of a polymer molecule and a sugar moiety may enhance the reduction in 
immunogenicity and the increase in functional in vivo or serum half-life. 

In the following sections "Conjugation to a lipophilic compound", "Conjugation to a 
polymer molecule", "Conjugation to a sugar moiety" and "Conjugation to an organic 

20 derivatizing agent" conjugation to specific types of non-polypeptide moieties is 
described. 

Conjugation to a lipophilic compound 

For conjugation to a lipophilic compound the following polypeptide groups may 
function as attachment groups: the N-terminus or C -terminus of the polypeptide, the 

25 hydroxy groups of the amino acid residues Ser, Thr or Tyr, the e-amino group of Lys, the 
SH group of Cys or the carboxyl group of Asp and Glu. The polypeptide and the 
lipophilic compound may be conjugated to each other either directly or by use of a linker. 
The lipophilic compound may be a natural compound such as a saturated or unsaturated 
fatty acid, a fatty acid diketone, a terpene, a prostaglandin, a vitamin, a carotenoid or 

30 steroid, or a synthetic compound such as a carbon acid, an alcohol, an amine and 
sulphonic acid with one or more alkyl, aryl, alkenyl or other multiple unsaturated 
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compounds. The conjugation between the polypeptide and the lipophilic compound, 
optionally through a linker may be done according to methods known in the art, e.g. as 
described by Bodanszky in Peptide Synthesis, John Wiley, New York, 1976 and in WO 
96/12505. 

5 Conjugation to a polymer molecule 

The polymer molecule to be coupled to the polypeptide may be any suitable polymer 
molecule, such as a natural or synthetic homo-polymer or heteropolymer, typically with a 
molecular weight in the range of about 300-100,000 Da, such as about 1000-50,000 Da, 
e.g. in the range of about 1000- 40,000 Da. More particularly, the polymer molecule, 

10 such as PEG, in particular mPEG, will typically have a molecular weight of about 2, 5, 
10, 12, 15, 20, 30, 40 or 50 kDa, in particular a molecular weight of about 5 kDa, about 
10 kDa, about 12 kDa, about 15 kDa, about 20 kDa, about 30 kDa or about 40 kDa. The 
PEG molecule may be branched (e.g., mPEG2), or may be unbranched (i.e., linear). 
When used about polymer molecules herein, the word "about" indicates an 

15 approximate average molecular weight and reflects the fact that there will normally be a 
certain molecular weight distribution in a given polymer preparation. 

Examples of homo-polymers include a polyol (i.e. poly-OH), a polyamine (i.e. poly- 
NH 2 ) and a polycarboxylic acid (i.e. poly-COOH). A hetero- polymer is a polymer which 
comprises one or more different coupling groups, such as a hydroxyl group and an amine 

20 group. 

Examples of suitable polymer molecules include polymer molecules selected from 
the group consisting of polyalkylene oxide (PAO), including polyalkylene glycol (PAG), 
such as polyethylene glycol (PEG) and polypropylene glycol (PPG), branched PEGs 
(PEG2), poly- vinyl alcohol (PVA), poly-carboxylate, poly-(vinylpyrolidone), 

25 polyethylene-co-maleic acid anhydride, polystyrene-co-malic acid anhydride, dextran 
including carboxymethyl-dextran, or any other biopolymer suitable for reducing 
immunogenicity and/or increasing functional in vivo half-life and/or serum half-life. 
Generally, polyalkylene glycol-derived polymers are biocompatible, non-toxic, non- 
antigenic, non-immunogenic, have various water solubility properties, and are easily 

30 excreted from living organisms. 
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PEG is the preferred polymer molecule to be used, since it has only few reactive 
groups capable of cross-linking compared to e.g. polysaccharides such as dextran. In 
particular, monofunctional PEG, e.g. monomethoxypolyethylene glycol (mPEG), is of 
interest since its coupling chemistry is relatively simple (only one reactive group is 
5 available for conjugating with attachment groups on the polypeptide). Consequently, the 
risk of cross-linking is eliminated, the resulting polypeptide conjugates are more 
homogeneous and the reaction of the polymer molecules with the polypeptide is easier to 
control. 

To effect covalent attachment of the polymer molecule(s) to the polypeptide, the 

10 hydroxyl end groups of the polymer molecule must be provided in activated form, i.e. 
with reactive functional groups (examples of which include primary amino groups, 
hydrazide (HZ), thiol, succinate (SUC), succinimidyl succinate (SS), succinimidyl 
succinamide (SSA), succinimidyl propionate (SPA), succinimidyl butanoate (SBA), 
succinimidyl carboxymethylate (SCM), benzotriazole carbonate (BTC), N- 

15 hydroxysuccinimide (NHS), aldehyde, nitrophenylcarbonate (NPC), and tresylate 

(TRES)). Suitably activated polymer molecules are commercially available, e.g. from 
Nektar Therapeutics, Inc., Huntsville, AL, USA; PolyMASC Pharmaceuticals pic, UK; or 
SunBio Corporation, Anyang City, South Korea. Alternatively, the polymer molecules 
can be activated by conventional methods known in the art, e.g. as disclosed in WO 

20 90/13540. Specific examples of activated linear or branched polymer molecules suitable 
for use in the present invention are described in the Nektar Therapeutics, Inc. 2003 
Catalog ("Nektar Molecule Engineering: Polyethylene Glycol and Derivatives for 
Advanced Pegylation, Catalog 2003")> incorporated by reference herein. Specific 
examples of activated PEG polymers include the following linear PEGs: NHS-PEG , 

25 SPA-PEG, SSPA-PEG, SBA-PEG, SS-PEG, SSA-PEG, SC-PEG, SG-PEG, SCM-PEG, 
NOR-PEG, BTC-PEG, EPOX-PEG, NCO-PEG, NPC-PEG, CDI-PEG, ALD-PEG, 
TRES-PEG, VS-PEG, OPSS-PEG, IODO-PEG, and MAL-PEG, and branched PEGs, 
such as PEG2-NHS, PEG2-MAL, and those disclosed in US 5,932,462 and US 
5,643,575, both of which are incorporated herein by reference. Furthermore, the 

30 following publications, incorporated herein by reference, disclose useful polymer 

molecules and/or PEGylation chemistries: US 5,824,778, US 5,476,653, WO 97/32607, 
EP 229,108, EP 402,378, US 4,902,502, US 5,281,698, US 5,122,614, US 5,219,564, WO 
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92/16555, WO 94/04193, WO 94/14758, WO 94/17039, WO 94/18247, WO 94/28024, 
WO 95/00162, WO 95/1 1924, WO95/13090, WO 95/33490, WO 96/00080, WO 
97/18832, WO 98/41562, WO 98/48837, WO 99/32134, WO 99/32139, WO 99/32140, 
WO 96/40791, WO 98/32466, WO 95/06058, EP 439 508, WO 97/03106, WO 96/21469, 
WO 95/13312, EP 921 131, US 5,736,625, WO 98/05363, EP 809 996, US 5,629,384, 
WO 96/41813, WO 96/07670, US 5,473,034, US 5,516,673, EP 605 963, US 5,382,657, 
EP 510 356, EP 400 472, EP 183 503 and EP 154 316. 

The conjugation of the polypeptide and the activated polymer molecules is 
conducted by use of any conventional method, e.g. as described in the following 
references (which also describe suitable methods for activation of polymer molecules): 
Harris and Zalipsky, eds., Poly(ethylene glycol) Chemistry and Biological Applications, 
AZC, Washington; R.F. Taylor, (1991), "Protein immobilisation. Fundamental and 
applications", Marcel Dekker, N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjuga- 
tion and Crosslinking", CRC Press, Boca Raton; G.T. Hermanson et al., (1993), 
"Immobilized Affinity Ligand Techniques", Academic Press, N.Y. 

For PEGylation of cysteine residues the polypeptide is usually treated with a 
reducing agent, such as dithiothreitol (DDT) prior to PEGylation. The reducing agent is 
subsequently removed by any conventional method, such as by desalting. Conjugation of 
PEG to a cysteine residue typically takes place in a suitable buffer at pH 6-9 at 
temperatures varying from 4°C to 25°C for periods up to about 16 hours. Examples of 
activated PEG polymers for coupling to cysteine residues include the following linear and 
branched PEGs: vinylsulfone-PEG (PEG-VS), such as vinylsulfone-mPEG (mPEG-VS); 
orthopyridyl-disulfide-PEG (PEG-OPSS), such as orthopyridyl-disulfide-mPEG (mPEG- 
OPSS); and maleimide-PEG (PEG-MAL), such as maleimide-mPEG (mPEG-MAL) and 
branched maleimide-mPEG2 (mPEG2-MAL). 

Pegylation of lysines often employs PEG-N-hydroxylsuccinimide (e.g., mPEG-NHS 
or mPEG2-NHS), or esters such as PEG succinimidyl propionate (e.g., mPEG-SPA) or 
PEG succinimidyl butanoate (e.g., mPEG-SBA). One or more PEGs can be attached to a 
protein within 30 minutes at pH 8-9.5 at room temperature if about equimolar amounts of 
PEG and protein are mixed. A molar ratio of PEG to protein amino groups of 1-5 to 1 
will usually suffice. Increasing pH increases the rate of reaction, while lowering pH 
reduces the rate of reaction. These highly reactive active esters can couple at 
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physiological pH, but less reactive derivatives typically require higher pH. Low 
temperatures may also be employed if a labile protein is being used. Under low 
temperature conditions, a longer reaction time may be used. 

N-terminal PEGylation is facilitated by the difference between the pKa values of 
5 the a-amino group of the N-terminal amino acid (-7.6 to 8.0) and the e-amino group of 
lysine (-10). PEGylation of the N-terminal amino group often employs PEG-aldehydes 
(such as mPEG-propionaldehyde or mPEG-butylaldehyde), which are more selective for 
amines and thus are less likely to react with the imidazole group of histidine; in addition, 
PEG reagents used for lysine conjugation (such as mPEG-SPA, mPEG-SBA, or mPEG- 

10 NHS) may also be used for conjugation of the N-terminal amine. Conjugation of a PEG- 
aldehyde to the N-terminal amino group typically takes place in a suitable buffer (such as, 
100 mM sodium acetate or 100 mM sodium bisphosphate buffer with 20 mM sodium 
cyanoborohydride) at pH - 5.0 overnight at temperatures varying from about 4°C to 
25°C. Useful N-terminal PEGylation methods and chemistries are also described in US 

15 Pat. 5,985,265 and US Pat. 6,077,939, both incorporated herein by reference. 

Typically, linear PEG or mPEG polymers will have a molecular weight of about 5 
kDa, about 10 kDa, about 12 kDa, about 15 kDa, about 20 kDa, or about 30 kDa. 
Branched PEG (PEG2 or mPEG2) polymers will typically have a molecular weight of 
about 10 kDa, about 20 kDa, or about 40 kDa. In some instances, the higher-molecular 

20 weight branched PEG2 reagents, such as 20 kDa or 40 kDa PEG2, including e.g. mPEG2- 
NHS for lysine PEGylation, mPEG2-MAL for cysteine PEGylation, or MPEG2-aldehyde 
for N-terminal PEGylation (all available from Nektar Therapeutics, Inc, Huntsville AL), 
may be used. The branched structure of the PEG2 compound results in a relatively large 
molecular volume, so fewer attached molecules (or, one attached molecule) may impart 

25 the desired characteristics of the PEGylated molecule. 

The skilled person will be aware that the activation method and/or conjugation 
chemistry to be used depends on the attachment group(s) of the interferon-alpha 
polypeptide as well as the functional groups of the polymer (e.g., being amino, hydroxyl, 
carboxyl, aldehyde or sulfhydryl). The PEGylation may be directed towards conjugation 

30 to all available attachment groups on the polypeptide (i.e. such attachment groups that are 
exposed at the surface of the polypeptide) or may be directed towards specific attachment 
groups, e.g. cysteine residues, lysine residues, or the N-terminal amino group. 
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Furthermore, the conjugation may be achieved in one step or in a stepwise manner (e.g. as 
described in WO 99/55377). 

In some instances, the polymer conjugation is performed under conditions aiming at 
reacting as many of the available polymer attachment groups as possible with polymer 
5 molecules. This is achieved by means of a suitable molar excess of the polymer in 
relation to the polypeptide. Typical molar ratios of activated polymer molecules to 
polypeptide are up to about 1000-1, such as up to about 200-1 or up to about 100-1. In 
some cases, the ratio may be somewhat lower, however, such as up to about 50-1, 10-1 or 
5-1. Also equimolar ratios may be used, 
10 It is also contemplated according to the invention to couple the polymer molecules to 

the polypeptide through a linker. Suitable linkers are well known to the skilled person. A 
preferred example is cyanuric chloride (Abuchowski et al., (1977), J. Biol. Chem., 252, 
3578-3581; US 4,179,337; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 
375-378). 

15 Subsequent to the conjugation residual activated polymer molecules are blocked 

according to methods known in the art, e.g. by addition of primary amine to the reaction 
mixture, and the resulting inactivated polymer molecules removed by a suitable method. 

Covalent in vitro coupling of a sugar moiety to amino acid residues of interferon- 
alpha may be used to modify or increase the number or profile of sugar substituents. 

20 Depending on the coupling mode used, the carbohydrate(s) may be attached to a) arginine 
and histidine (Lundblad and Noyes, Chemical Reagents for Protein Modification, CRC 
Press Inc. Boca Raton, FI), b) free carboxyl groups (e.g. of the C-terminal amino acid 
residue, asparagine or glutamine), c) free sulfhydryl groups such as that of cysteine, d) 
free hydroxyl groups such as those of serine, threonine, tyrosine or hydroxyproline, e) 

25 aromatic residues such as those of phenylalanine or tryptophan or f) the amide group of 
glutamine. These amino acid residues constitute examples of attachment groups for a 
sugar moiety, which may be introduced and/or removed in the interferon-alpha 
polypeptide. Suitable methods of in vitro coupling are described in WO 87/05330 and in 
Aplin et al., CRC Crit Rev. Biochem., pp. 259-306, 1981. The in vitro coupling of sugar 

30 moieties or PEG to protein- and peptide-bound Gin-residues can also be carried out by 
transglutaminases (TGases), e.g. as described by Sato et al., 1996 Biochemistry 35, 
13072-13080 or in EP 725145. 
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Coupling to a sugar moiety 

In order to achieve in vivo glycosylation of an interferon-alpha polypeptide that has 
been modified by introduction of one or more glycosylation sites, the nucleotide sequence 
encoding the polypeptide part of the conjugate is inserted in a glycosylating, eukaryotic 

5 expression host. The expression host cell may be selected from fungal (filamentous 
fungal or yeast), insect, mammalian animal cells, from transgenic plant cells or from 
transgenic animals. Furthermore, the glycosylation may be achieved in the human body 
when using a nucleotide sequence encoding the polypeptide part of a conjugate of the 
invention or a polypeptide of the invention in gene therapy. In one aspect the host cell is a 

10 mammalian cell, such as a CHO cell, a COS cell, a BHK or HEK cell, e.g. HEK293, or an 
insect cell, such as an SF9 cell, or a yeast cell, e.g. Saccharomyces cerevisiae, Pichia 
pastoris or any other suitable glycosylating host, e.g. as described further below. 
Optionally, sugar moieties attached to the interferon-a polypeptide by in vivo 
glycosylation are further modified by use of glycosyltransferases, e.g. using the 

15 GlycoAdvance™ technology marketed by Neose, Horsham, PA, USA. Thereby, it is 
possible to, e.g., increase the sialyation of the glycosylated interferon-alpha polypeptide 
following expression and in vivo glycosylation by CHO cells. 

Coupling to an organic derivatizing agent 

Covalent modification of the interferon-alpha polypeptide may be performed by 
20 reacting (an) attachment group(s) of the polypeptide with an organic derivatizing agent. 
Suitable derivatizing agents and methods are well known in the art. For example, 
cysteinyl residues most commonly are reacted with a-haloacetates (and corresponding 
amines), such as chloroacetic acid or chloroacetamide, to give carboxymethyl or 
carboxyamidomethyl derivatives. Cysteinyl residues also are derivatized by reaction with 
25 bromotrifluoroacetone, a-bromo-f}-(4-imidozoyl)propionic acid, chloroacetyl phosphate, 
N-alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2-pyridyl disulfide, p- 
chloromercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7-nitrobenzo-2-oxa-l,3- 
diazole. Histidyl residues are derivatized by reaction with diethylpyrocarbonate at pH 5.5- 
7.0 because this agent is relatively specific for the histidyl side chain. Para- 
30 bromophenacyl bromide is also useful; the reaction is preferably performed in 0.1 M 
sodium cacodylate at pH 6.0. Lysinyl and amino terminal residues are reacted with 
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succinic or other carboxylic acid anhydrides. Derivatizatibn with these agents has the 
effect of reversing the charge of the lysinyl residues. Other suitable reagents for 
derivatizing oc-amino-containing residues include imidoesters such as methyl 
picolinimidate; pyridoxal phosphate; pyridoxal; chloroborohydride; 

5 trinitrobenzenesulfonic acid; O-methylisourea; 2,4-pentanedione; and transaminase- 

catalyzed reaction with glyoxylate. Arginyl residues are modified by reaction with one or 
several conventional reagents, among them phenylglyoxal, 2,3-butanedione, 1,2- 
cyclohexanedione, and ninhydrin. Derivatization of arginine residues requires that the 
reaction be performed in alkaline conditions because of the high pKa of the guanidine 

10 functional group. Furthermore, these reagents may react with the groups of lysine as well 
as the arginine guanidino group. Carboxyl side groups (aspartyl or glutamyl or C-terminal 
amino acid residue) are selectively modified by reaction with carbodiimides (R-N=C=N- 
R'), where R and R' are different alkyl groups, such as l-cyclohexyl-3-(2-morpholinyl-4- 
ethyl) carbodiimide or l-ethyl-3-(4-azonia-4,4-dimethylpentyl) carbodiimide. 

15 Furthermore, aspartyl and glutamyl residues are converted to asparaginyl and glutaminyl 
residues by reaction with ammonium ions. 

Blocking of a functional site 

Since excessive polymer conjugation may lead to a loss of activity of the interferon- 
a polypeptide to which the polymer is conjugated, it may be advantageous to remove 

20 attachment groups located at the functional site or to block the functional site prior to 
conjugation. These latter strategies constitute further aspects of the invention (the first 
strategy being exemplified further above, e.g. by removal of lysine residues which may be 
located close to a functional site). More specifically, according to the second strategy the 
conjugation between the interferon-alpha polypeptide and the non-polypeptide moiety is 

25 conducted under conditions where the functional site of the polypeptide is blocked by a 
helper molecule capable of binding to the functional site of the polypeptide. Preferably, 
the helper molecule is one which specifically recognizes a functional site of the 
polypeptide, such as a receptor, in particular the type I interferon receptor. Alternatively, 
the helper molecule may be an antibody, in particular a monoclonal antibody recognizing 

30 the interferon-alpha polypeptide. In particular, the helper molecule may be a neutralizing 
monoclonal antibody. 
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The polypeptide is allowed to interact with the helper molecule before effecting 
conjugation. This ensures that the functional site of the polypeptide is shielded or 
protected and consequently unavailable for derivatization by the non-polypeptide moiety 
such as a polymer. Following its elution from the helper molecule, the conjugate between 
the non-polypeptide moiety and the polypeptide can be recovered with at least a partially 
preserved functional site. 

The subsequent conjugation of the polypeptide having a blocked functional site to a 
polymer, a lipophilic compound, an organic derivatizing agent or any other compound is 
conducted in the normal way, e.g. as described in the sections above entitled 
"Conjugation to 

Irrespective of the nature of the helper molecule to be used to shield the functional 
site of the polypeptide from conjugation, it is desirable that the helper molecule is free 
from or comprises only a few attachment groups for the non-polypeptide moiety of choice 
in parts of the molecule where the conjugation to such groups would hamper the 
desorption of the conjugated polypeptide from the helper molecule. Hereby, selective 
conjugation to attachment groups present in non-shielded parts of the polypeptide can be 
obtained and it is possible to reuse the helper molecule for repeated cycles of conjugation. 
For instance, if the non-polypeptide moiety is a polymer molecule such as PEG, which 
has the epsilon amino group of a lysine or N-terminal amino acid residue as an 
attachment group, it is desirable that the helper molecule is substantially free from 
conjugatable epsilon amino groups, preferably free from any epsilon amino groups. 
Accordingly, in some instances the helper molecule is a protein or peptide capable of 
binding to the functional site of the polypeptide, which protein or peptide is free from any 
conjugatable attachment groups for the non-polypeptide moiety of choice. 

In a further aspect the helper molecule is first covalently linked to a solid phase such 
as column packing materials, for instance Sephadex or agarose beads, or a surface, e.g. 
reaction vessel. Subsequently, the polypeptide is loaded onto the column material 
carrying the helper molecule and conjugation carried out according to methods known in 
the art, e.g; as described in the sections above entitled "Conjugation to This 
procedure allows the polypeptide conjugate to be separated from the helper molecule by 
elution. The polypeptide conjugate is eluted by conventional techniques under physico- 
chemical conditions that do not lead to a substantive degradation of the polypeptide 



87 



WO 2005/113592 



PCT/US2005/017471 



conjugate. The fluid phase containing the polypeptide conjugate is separated from the 
solid phase to which the helper molecule remains covalently linked. The separation can 
be achieved in other ways: For instance, the helper molecule may be derivatized with a 
second molecule (e.g. biotin) that can be recognized by a specific binder (e.g. 
strep tavidin). The specific binder may be linked to a solid phase thereby allowing the 
separation of the polypeptide conjugate from the helper molecule-second molecule 
complex through passage over a second helper-solid phase column which will retain, 
upon subsequent elution, the helper molecule-second molecule complex, but not the 
polypeptide conjugate. The polypeptide conjugate may be released from the helper 
molecule in any appropriate fashion. De-protection may be achieved by providing 
conditions in which the helper molecule dissociates from the functional site of the 
interferon-oc to which it is bound. For instance, a complex between an antibody to which a 
polymer is conjugated and an anti-idiotypic antibody can be dissociated by adjusting the 
pH to an acid or alkaline pH. 

Conjugation of a tagged interferon-alpha polypeptide 

In another aspect the interferon-alpha polypeptide is expressed as a fusion protein 
with a tag, i.e. an amino acid sequence or peptide made up of typically 1-30, such as 1-20 
or 1-15 or 1-10 or 1-5 amino acid residues, e.g. added to the N-terminus or to the C- 
terminus of the polypeptide. Besides allowing for fast and easy purification, the tag is a 
convenient tool for achieving conjugation between the tagged polypeptide and the non- 
polypeptide moiety. In particular, the tag may be used for achieving conjugation in 
microtiter plates or other carriers, such as paramagnetic beads, to which the tagged 
polypeptide can be immobilised via the tag. The conjugation to the tagged polypeptide in, 
e.g., microtiter plates has the advantage that the tagged polypeptide can be immobilised in 
the microtiter plates directly from the culture broth (in principle without any purification) 
and subjected to conjugation. Thereby, the total number of process steps (from expression 
to conjugation) can be reduced. Furthermore, the tag may function as a spacer molecule 
ensuring an improved accessibility to the immobilised polypeptide to be conjugated. The 
conjugation using a tagged polypeptide may be to any of the non-polypeptide moieties 
disclosed herein, e.g. to a polymer molecule such as PEG. 
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The identity of the specific tag to be used is not critical as long as the tag is capable 
of being expressed with the polypeptide and is capable of being immobilised on a suitable 
surface or carrier material. A number of suitable tags are commercially available, e.g. 
from Unizyme Laboratories, Denmark. Antibodies against such tags are commercially 
5 available, e.g. from ADI, Aves Lab and Research Diagnostics. 

POLYNUCLEOTIDES OF THE INVENTION 

The invention provides isolated or recombinant nucleic acids (also referred to herein 
as polynucleotides), collectively referred to as "nucleic acids (or polynucleotides) of the 
invention", which encode polypeptides of the invention. The polynucleotides of the 

10 invention are useful in a variety of applications. As discussed above, the polynucleotides 
are useful in producing polypeptides of the invention. In addition, polynucleotides of the 
invention can be incorporated into expression vectors useful for gene therapy, DNA 
vaccination, and immunotherapy, as described in more detail below. 

In one aspect, the invention provides isolated or recombinant polynucleotides that 

15 each comprise a nucleic acid sequence which encodes a polypeptide comprising an amino 
acid sequence selected from SEQ ID NOs:2-35 and 37-44; or, a complementary nucleic 
acid sequence thereof. Exemplary nucleic acid sequences according to this aspect of the 
invention include, but are not limited to, SEQ ID NOs:59-61, which each encode a 
polypeptide comprising the sequence SEQ ID NO: 10; SEQ ID NOs:62-64 and 89, which 

20 each encode a polypeptide comprising the sequence SEQ ID NO: 13; SEQ ID NOs: 65-67, 
which each encode a polypeptide comprising the sequence SEQ ED NO: 15; SEQ ID 
NOs:68-70, which each encode a polypeptide comprising the sequence SEQ ID NO:23; 
SEQ ID NOs:71-73, which each encode a polypeptide comprising the sequence SEQ ID 
NO:27; SEQ ID NOs:74-76, which each encode a polypeptide comprising the sequence 

25 SEQ ID NO: 30; SEQ ID NOs: 77-79, which each encode a polypeptide comprising the 
sequence SEQ ID NO:37; SEQ ID NOs:80-82 and 90, which each encode a polypeptide 
comprising the sequence SEQ ID NO:38; SEQ ID NOs:83-85, which each encode a 
polypeptide comprising the sequence SEQ ID NO:41; and SEQ ID NOs:86-88, which 
each encode a polypeptide comprising the sequence SEQ ID NO:44. 

30 The invention also provides isolated or recombinant polynucleotides that each 

comprise a nucleic acid sequence which encodes a polypeptide comprising an amino acid 
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sequence which (a) differs in 1 to 16 amino acid positions (such as in 1, 2, 3, 4, 5, 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, in 
1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 1- 
11 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1-8 
5 amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 
amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions) from SEQ ID NO: 1 and (b) comprises one or more substitutions 
relative to SEQ ID NO:l selected from the group consisting of F48A/L, V51P, F55A, 
F65A, F68P, LI 1 1 A, and VI 14P; or, a complementary nucleic acid sequence thereof. 

10 Some polypeptides encoded by polynucleotides of the invention further comprise one or 
more additional amino acid(s), such as a methionine added to the N-terminus. In some 
instances the encoded polypeptide exhibits an interferon-alpha activity. 

The invention also provides isolated or recombinant polynucleotides that each 
comprise a nucleic acid sequence which encodes a polypeptide comprising an amino acid 

15 sequence which (a) differs in 1 to 16 amino acid positions (such as in 1, 2, 3, 4, 5, 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, or 16 amino acid positions, e.g., in 1-15 amino acid positions, in 
1-14 amino acid positions, in 1-13 amino acid positions, in 1-12 amino acid positions, 1- 
1 1 amino acid positions, in 1-10 amino acid positions, in 1-9 amino acid positions, in 1-8 
amino acid positions, in 1-7 amino acid positions, in 1-6 amino acid positions, in 1-5 

20 amino acid positions, in 1-4 amino acid positions, in 1-3 amino acid positions, or in 1-2 
amino acid positions) from SEQ ID NO: 36 and (b) comprises one or more substitutions 
relative to SEQ ID NO:36 selected from the group consisting of M21A, I24P, F48A/L, 
T51P, S55A, F65A, F68P, F90A, M93P, L111A, V114P, F124A, I127P, and E160D; or, 
a complementary nucleic acid sequence thereof. Some polypeptides encoded by 

25 polynucleotides of the invention further comprise one or more additional amino acid(s), 
such as a methionine added to the N-terminus. In some instances the encoded 
polypeptide exhibits an interferon-alpha activity. 

The invention also provides isolated or recombinant polynucleotides that each 
comprise a nucleic acid sequence which encodes a polypeptide comprising an amino acid 

30 sequence having at least 90% sequence identity (e.g., at least about 91%, at least about 
92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at 
least about 97%, at least about 98%, or at least about 99% amino acid sequence identity) 
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to any one of SEQ ID NOs:l-35, wherein the amino acid sequence comprises one or more 
of: Ala or Leu at position 48; Pro at position 51; Ala at position 55; Ala at position 65; 
Pro at position 68; Ala at position 111; and Pro at position 1 14, position numbering 
relative to SEQ ID NO: 1; or, a complementary nucleic acid sequence thereof. Some 

5 polypeptides encoded by polynucleotides of the invention further comprise one or more 
additional amino acid(s), such as a methionine added to the N-terminus. In some 
instances the encoded polypeptide exhibits an interferon-alpha activity. 

The invention also provides isolated or recombinant polynucleotides that each 
comprise a nucleic acid sequence which encodes a polypeptide comprising an amino acid 

10 sequence having at least 90% sequence identity (e.g., at least about 91%, at least about 
92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at 
least about 97%, at least about 98%, or at least about 99% amino acid sequence identity) 
to any one of SEQ ID NOs: 36-44, wherein the amino acid sequence comprises one or 
more of: Ala at position 21; Pro at position 24; Ala or Leu at position 48; Pro at position 

15 51; Ala at position 55; Ala at position 65; Pro at position 68; Ala at position 90; Pro at 
position 93; Ala at position 111; Pro at position 1 14; Ala at position 124; Pro at position 
127; and Asp at position 160, position numbering relative to SEQ ID NO: 36; or, a 
complementary nucleic acid sequence thereof. Some polypeptides encoded by 
polynucleotides of the invention further comprise one or more additional amino acid(s), 

20 such as a methionine added to the N-terminus. In some instances the encoded polypeptide 
exhibits an interferon-alpha activity. 

Additional Aspects 

Any of the polynucleotides of the invention (which includes those described above) 
may encode a fusion protein comprising at least one additional amino acid sequence, such 

25 as, for example, a secretion/localization sequence, a sequence useful for solubilization or 
immobilization (e.g., for cell surface display) of the polypeptide, a sequence useful for 
detection and/or purification of the polypeptide (e.g., a polypeptide purification 
subsequence, such as an epitope tag, a polyhistidine sequence, and the like). 
In another aspect, the invention provides cells comprising one or more of the 

30 polynucleotides of the invention. Such cells may express one or more polypeptides 
encoded by the polynucleotides of the invention. 
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The invention also provides vectors comprising any of the polynucleotides of the 
invention. Such vectors may comprise a plasmid, a cosmid, a phage, a virus, or a 
fragment of a virus. Such vectors may comprise an expression vector, and, if desired, the 
nucleic acid is operably linked to a promoter, including those discussed herein and below. 

5 Furthermore, in another aspect, the invention provides compositions comprising an 
excipient or carrier and at least one of any of the polynucleotides of the invention, or 
vectors, cells, or host comprising such nucleic acids. Such composition may be 
pharmaceutical compositions, and the excipient or carrier may be a pharmaceutically 
acceptable excipient or carrier. 

10 The invention also includes compositions comprising two or more nucleic acids of 

the invention, or fragments thereof (e.g., as substrates for recombination). The 
composition can comprise a library of recombinant nucleic acids, where the library 
contains at least 2, at least 3, at least 5, at least 10, at least 20, at least 50, or at least 100 
or more nucleic acids described above. The nucleic acids are optionally cloned into 

15 expression vectors, providing expression libraries. 

The polynucleotides of the invention and fragments thereof, as well as vectors 
comprising such polynucleotides, may be employed for therapeutic or prophylactic uses 
in combination with a suitable carrier, such as a pharmaceutical carrier. Such 
compositions comprise a therapeutically and/or prophylactically effective amount of the 

20 compound, and a pharmaceutically acceptable carrier or excipient. Such a carrier or 

excipient includes, but is not limited to, saline, buffered saline, dextrose, water, glycerol, 
ethanol, and combinations thereof. The formulation should suit the mode of 
administration. Methods of administering nucleic acids, polypeptides, and proteins are 
well known in the art, and are further discussed below. 

25 The invention also includes compositions produced by digesting one or more of any 

of the nucleic acids of the invention with a restriction endonuclease, an RNAse, or a 
DNAse (e.g., as is performed in certain of the recombination formats noted above); and 
compositions produced by fragmenting or shearing one or more nucleic acids of the 
invention by mechanical means (e.g., sonication, vortexing, and the like), which can also 

30 be used to provide substrates for recombination in the methods described herein. The 
invention also provides compositions produced by cleaving at least one of any of the 
nucleic acids of the invention. The cleaving may comprise mechanical, chemical, or 
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enzymatic cleavage, and the enzymatic cleavage may comprise cleavage with a restriction 
endonuclease, an RNAse, or a DNAse. 

Also included in the invention are compositions produced by a process comprising 
incubating one or more of the fragmented nucleic acids of the invention in the presence of 
ribonucleotide or deoxyribonucleotide triphosphates and a nucleic acid polymerase. This 
resulting composition forms a recombination mixture for many of the recombination 
formats noted above. The nucleic acid polymerase may be an RNA polymerase, a DNA 
polymerase, or an RNA-directed DNA polymerase (e.g., a "reverse transcriptase"); the 
polymerase can be, e.g., a thermostable DNA polymerase (e.g., VENT, TAQ, or the like). 

Similarly, compositions comprising sets of oligonucleotides corresponding to more 
than one nucleic acids of the invention are useful as recombination substrates and are a 
feature of the invention. For convenience, these fragmented, sheared, or oligonucleotide 
synthesized mixtures are referred to as fragmented nucleic acid sets. 

The invention also provides an isolated or recombinant nucleic acid encoding a 
polypeptide that exhibits an interferon-alpha activity, produced by mutating or 
recombining at least one nucleic acid of the invention. 

Making Polynucleotides 

Polynucleotides, oligonucleotides, and nucleic acid fragments of the invention can 
be prepared by standard solid-phase methods, according to known synthetic methods. 
20 Typically, fragments of up to about 100 bases are individually synthesized, then joined 
(e.g., by enzymatic or chemical ligation methods, or polymerase mediated recombination 
methods) to form essentially any desired continuous sequence. For example, the 
polynucleotides and oligonucleotides of the invention can be prepared by chemical 
synthesis using, e.g., classical phosphoramidite method described by, e.g., Beaucage et al. 
25 (1981) Tetrahedron Letters 22: 1859-69, or the method described by Matthes et al. (1984) 
EMBO J 3:801-05, e.g., as is typically practiced in automated synthetic methods. 
According to the phosphoramidite method, oligonucleotides are synthesized, e.g., in an 
automatic DNA synthesizer, purified, annealed, ligated and cloned into appropriate 
vectors. 

30 In addition, essentially any polynucleotide can be custom ordered from any of a 

variety of commercial sources, such as Operon Technologies Inc. (Alameda, CA) and 

93 



10 



WO 2005/113592 



PCT/US2005/0 17471 



many others. Similarly, peptides and antibodies can be custom ordered from any of a 
variety of sources, e.g., Celtek Peptides (Nashville, TN); Washington Biotechnology, Inc. 
(Baltimore MD); Global Peptide Services (Ft. Collin CO), and many others. 

Certain polynucleotides of the invention may also be obtained by screening cDNA 
libraries (e.g., libraries generated by recombining homologous nucleic acids as in typical 
recursive sequence recombination methods) using oligonucleotide probes that can 
hybridize to or PCR-amplify polynucleotides which encode interferon-alpha polypeptides 
and fragments of those polypeptides. Procedures for screening and isolating cDNA 
clones are well-known to those of skill in the art. Such techniques are described in, e.g., 
Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymol. Vol. 
152, Acad. Press, Inc., San Diego, CA ("Berger"); Sambrook, supra, and Current 
Protocols in Molecular Biology, Ausubel, supra. Some polynucleotides of the invention 
can be obtained by altering a naturally occurring sequence, e.g., by mutagenesis, 
recursive sequence recombination (e.g., shuffling), or oligonucleotide recombination. In 
other cases, such polynucleotides can be made in silico or through oligonucleotide 
recombination methods as described in the references cited herein. 

As described in more detail herein, the polynucleotides of the invention include 
polynucleotides that encode polypeptides of the invention, polynucleotide sequences 
complementary to these polynucleotide sequences, and polynucleotides that hybridize 
under at least stringent conditions to the sequences defined herein. A coding sequence 
refers to a polynucleotide sequence encoding a particular polypeptide or domain, region, 
or fragment of said polypeptide. A coding sequence may encode (code for) a polypeptide 
of the invention exhibiting an interferon alpha activity as described above. The 
polynucleotides of the invention may be in the form of RNA or in the form of DNA, and 
include mRNA, cRNA, synthetic RNA and DNA, and cDNA. The polynucleotides may 
be double-stranded or single-stranded, and if single-stranded, can be the coding strand or 
the non-coding (anti-sense, complementary) strand. The polynucleotides of the invention 
include the coding sequence of a polypeptide of the invention (i) in isolation, (ii) in 
combination with one or more additional coding sequences, so as to encode, e.g., a fusion 
protein, a pre-protein, a prepro-protein, or the like, (iii) in combination with non-coding 
sequences, such as introns, control elements, such as a promoter (e.g., naturally occurring 
or recombinant or shuffled promoter), a terminator element, or 5' and/or 3' untranslated 
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regions effective for expression of the coding sequence in a suitable host, and/or (iv) in a 
vector, cell, or host environment in which the coding sequence is a heterologous gene. 

Polynucleotides of the invention can also be found in combination with typical 
compositional formulations of nucleic acids, including in the presence of carriers, buffers, 

5 adjuvants, excipients, and the like, as are known to those of ordinary skill in the art. 

Polynucleotide fragments typically comprise at least about 200 nucleotide bases, such as 
at least about 250, 300, 350, 400, 450, 460, 470, or more bases. The nucleotide fragments 
of polynucleotides of the invention may hybridize under highly stringent conditions to a 
polynucleotide sequence described herein and/or encode amino acid sequences having at 

10 least one of the properties of polypeptides of the invention described herein. 

Modified Coding Sequences 

As will be understood by those of ordinary skill in the art, it can be advantageous to 
modify a coding sequence to enhance its expression in a particular host. The genetic code 
is redundant with 64 possible codons, but most organisms preferentially use a subset of 
15 these codons. The codons that are utilized most often in a species are considered optimal 
codons, and those not utilized very often are classified as rare or low-usage codons (see, 
e.g., Zhang, S. P. et al. (1991) Gene 105:61-72). Codons can be substituted to reflect the 
preferred codon usage of the host, a process sometimes termed "codon optimization" or 
"controlling for species codon bias." 
20 Modified coding sequence containing codons preferred by a particular prokaryotic or 

eukaryotic host (see, e.g., Murray, E. et al. (1989) Nuc Acids Res 17:477-508; Griswold 
et al., (2003) Protein Expr. Purif. 27(1): 134-42) can be prepared, for example, to increase 
the rate of translation or to produce recombinant RNA transcripts having desirable 
properties, such as a longer half-life, as compared with transcripts produced from a non- 
25 optimized sequence. Translation stop codons can also be modified to reflect host 

preference. For example, preferred stop codons for 5. cerevisiae and mammals are UAA 
and UGA respectively. The preferred stop codon for monocotyledonous plants is UGA, 
whereas insects and E. coli prefer to use UAA as the stop codon (Dalphin, M.E. et al. 
(1996) NucL Acids Res. 24:216-218). 
30 The polynucleotide sequences of the present invention can be engineered in order to 

alter a coding sequence of the invention for a variety of reasons, including but not limited 
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to, alterations which modify the cloning, processing and/or expression of the gene 
product. For example, alterations may be introduced using techniques which are well 
known in the art, e.g., site-directed mutagenesis, to insert new restriction sites, to alter 
glycosylation patterns, to introduce or remove attachment groups (e.g., for pegylation or 
5 other conjugation), to change codon preference, to introduce splice sites, etc. 

Silent Variations 

Because of the degeneracy of the genetic code, a large number of functionally 
identical nucleic acids encode any given polypeptide. For instance, inspection of the 
codon table below (Table 5) shows that codons AGA, AGG, CGA, CGC, CGG, and CGU 
10 all encode the amino acid arginine. Thus, at every position in a nucleic acid sequence 
where an arginine is specified by a codon, the codon can be altered to any of the 
corresponding codons described above without altering the encoded polypeptide. Such 
nucleic acid variations are "silent variations". It is to be understood that U in an RNA 
sequence corresponds to T in a DNA sequence. 

15 

Table 5 
Codon Table 



Amino acid 


Codon(s) 


Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 


Cysteine 


Cys 


C 


UGC 


UGU 






Aspartic acid 


Asp 


D 


GAC 


GAU 






Glutamic acid 


Glu 


E 


GAA 


GAG 






Phenylalanine 


Phe 


F 


uuc 


UUU 






Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 


Histidine 


His 


H 


CAC 


CAU 






Isoleucine 


De 


I 


AUA 


AUC 


AUU 




Lysine 


Lys 


K 


AAA 


AAG 






Leucine 


Leu 


L 


UUA 


UUG 


CUA 


CUC CUG CUU 


Methionine 


Met 


M 


AUG 








Asparagine 


Asn 


N 


AAC 


AAU 






Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 


Glutamine 


Gin 


Q 


CAA 


CAG 
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Amino acid 


Codon(s) 


Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


ucu 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


W 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











It will thus be appreciated by those skilled in the art that due to the degeneracy of the 
genetic code, a multitude of nucleic acids sequences encoding polypeptides of the 
invention may be produced, some of which may bear minimal sequence identity to the 
nucleic acid sequences explicitly disclosed herein. One of ordinary skill in the art will 
recognize that each codon in a nucleic acid (except AUG and UGC, which are ordinarily 
the only codon for methionine and tryptophan, respectively) can be modified by standard 
techniques to encode a functionally identical polypeptide. Accordingly, each silent 
variation of a nucleic acid which encodes a polypeptide is implicit in any described 
sequence. The invention also provides each and every possible variation of a nucleic acid 
sequence encoding a polypeptide of the invention that can be made by selecting 
combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet (codon) genetic code (e.g., as set forth in Table 5), as 
applied to the nucleic acid sequence encoding a polypeptide of the invention. All such 
variations of every nucleic acid herein are specifically provided and described by 
consideration of the sequence in combination with the genetic code. One of skill is fully 
able to generate any silent substitution of the sequences listed herein. 

Using Polynucleotides 

The polynucleotides of the invention have a variety of uses in, for example, 
recombinant production (i.e., expression) of the polypeptides of the invention typically 
through expression of a plasmid expression vector comprising a sequence encoding the 
polypeptide or fragment thereof; as therapeutics; as prophylactics; as diagnostic tools; as 
immunogens; as adjuvants; as diagnostic probes for the presence of complementary or 
partially complementary nucleic acids (including for detection of a wild-type interferon- 



97 



WO 2005/113592 



PCT/US2005/017471 



alpha nucleic acid), as substrates for further reactions, e.g., recursive sequence 
recombination reactions or mutation reactions to produce new and/or improved variants, 
and the like. 

Vectors, Promoters, and Expression Systems 

5 The present invention also includes recombinant constructs comprising one or more 

of the nucleic acid sequences as broadly described above. The constructs comprise a 
vector, such as, a plasmid, a cosmid, a phage, a virus, a bacterial artificial chromosome 
(BAC), a yeast artificial chromosome (YAC), and the like, into which a nucleic acid 
sequence of the invention has been inserted, in a forward or reverse orientation. In some 

10 instances, the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the nucleic acid sequence. Large numbers of suitable 
vectors and promoters are known to those of skill in the art, and are commercially 
available. 

General texts that describe molecular biological techniques useful herein, including 

15 the use of vectors, promoters and many other relevant topics, include Berger, supra; 

Sambrook (1989), supra, and Ausubel, supra. Examples of techniques sufficient to direct 
persons of skill through in vitro amplification methods, including the polymerase chain 
reaction (PCR) the ligase chain reaction (LCR), Qp-replicase amplification and other 
RNA polymerase mediated techniques (e.g., NASBA), e.g., for the production of the 

20 homologous nucleic acids of the invention are found in Berger, Sambrook, and Ausubel, 
all supra, as well as Mullis et al. (1987) U.S. Patent No. 4,683,202; PCR Protocols: A 
Guide to Methods and Applications (Innis et al., eds.) Academic Press Inc. San Diego, 
CA (1990) ("Innis"); Arnheim & Levinson (October 1, 1990) C&EN 36-47; The Journal 
Of NIH Research (1991) 3:81-94; (Kwoh et al. (1989) Proc Natl Acad Sci USA 86:1173- 

25 1 177; Guatelli et al. (1990) Proc Natl Acad Sci USA 87: 1874-1878; Lomeli et al. (1989) J 
Clin Chem 35:1826-1831; Landegren et al. (1988) Science 241:1077-1080; Van Brunt 
(1990) Biotechnology 8:291-294; Wu and Wallace (1989) Gene 4:560-569; Barringer et 
al. (1990) Gene 89:117-122, and Sooknanan and Malek (1995) Biotechnology 13:563- 
564. Improved methods of cloning in vitro amplified nucleic acids are described in 

30 Wallace et al., U.S. Pat. No. 5,426,039. Improved methods of amplifying large nucleic 
acids by PCR are summarized in Cheng et al. (1994) Nature 369:684-685 and the 
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references therein, in which PGR amplicons of up to 40 kilobases (kb) are generated. 
One of skill will appreciate that essentially any RNA can be converted into a double 
stranded DNA suitable for restriction digestion, PCR expansion and sequencing using 
reverse transcriptase and a polymerase. See Ausubel, Sambrook and Berger, all supra. 

5 The present invention also provides host cells that are transduced with vectors of the 

invention, and the production of polypeptides of the invention by recombinant techniques. 
Host cells are genetically engineered (e.g., transduced, transformed or transfected) with 
the vectors of this invention, which may be, for example, a cloning vector or an 
expression vector. The vector may be, for example, in the form of a plasmid, a viral 

10 particle, a phage, etc. The engineered host cells can be cultured in conventional nutrient 
media modified as appropriate for activating promoters, selecting transformants, or 
amplifying genes. The culture conditions, such as temperature, pH, and the like, are those 
previously used with the host cell selected for expression, and will be apparent to those 
skilled in the art and in the references cited herein, including, e.g., Freshney (1994) 

15 Culture of Animal Cells, a Manual of Basic Technique, third edition, Wiley- Liss, New 
York and the references cited therein. 

The polypeptides of the invention can also be produced in non-animal cells such as 
plants, yeast, fungi, bacteria and the like. In addition to Sambrook, Berger and Ausubel, 
details regarding cell culture are found in, e.g., Payne et al. (1992) Plant Cell and Tissue 

20 Culture in Liquid Systems John Wiley & Sons, Inc. New York, NY; Gamborg and 
Phillips (eds.) (1995) Plant Cell, Tissue and Organ Culture; Fundamental Methods 
Springer Lab Manual, Springer-Verlag (Berlin Heidelberg NY); Atlas & Parks (eds.) The 
Handbook of Microbiological Media (1993) CRC Press, Boca Raton, FL. 

The polynucleotides of the present invention and fragments thereof may be included 

25 in any one of a variety of expression vectors for expressing a polypeptide. Such vectors 
include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives 
of SV40, bacterial plasmids, phage DNA, baculovirus, yeast plasmids, vectors derived 
from combinations of plasmids and phage DNA, viral DNA such as vaccinia, adenovirus, 
fowl pox virus, pseudorabies, adeno-associated virus, retroviruses and many others. Any 

30 vector that transduces genetic material into a cell, and, if replication is desired, which is 
replicable and viable in the relevant host can be used. 
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The nucleic acid sequence in the expression vector is operatively linked to an 
appropriate transcription control sequence (promoter) to direct mRNA synthesis. 
Examples of such promoters include: LTR or SV40 promoter, E. coli lac or trp promoter, 
phage lambda P L promoter, CMV promoter, and other promoters known to control 
expression of genes in prokaryotic or eukaryotic cells or their viruses. The expression 
vector also contains a ribosome binding site for translation initiation, and a transcription 
terminator. The vector optionally includes appropriate sequences for amplifying 
expression, e.g., an enhancer. In addition, the expression vectors optionally comprise one 
or more selectable marker genes to provide a phenotypic trait for selection of transformed 
host cells, such as dihydrofolate reductase or neomycin resistance for eukaryotic cell 
culture, or such as tetracycline or ampicillin resistance in E. coli. 

The vector containing the appropriate DNA sequence encoding a polypeptide of the 
invention, as well as an appropriate promoter or control sequence, may be employed to 
transform an appropriate host to permit the host to express the polypeptide. Examples of 
appropriate expression hosts include: bacterial cells, such as E. coli, Streptomyces, and 
Salmonella typhimurium; fungal cells, such as Saccharomyces cerevisiae, Pichia pastoris, 
and Neurospora crassa\ insect cells such as Drosophila and Spodoptera frugiperda\ 
mammalian cells such as CHO, COS, BHK, HEK 293 or Bowes melanoma; plant cells, 
etc. It is understood that not all cells or cell lines need to be capable of producing fully 
functional polypeptides of the invention or fragments thereof; for example, antigenic 
fragments of the polypeptide may be produced in a bacterial or other expression system. 
The invention is not limited by the host cells employed. 

In bacterial systems, a number of expression vectors may be selected depending 
upon the use intended for the polypeptide or fragment thereof. For example, when large 
quantities of a polypeptide or fragments thereof are needed for the induction of 
antibodies, vectors which direct high level expression of fusion proteins that are readily 
purified may be desirable. Such vectors include, but are not limited to, multifunctional E. 
coli cloning and expression vectors such as BLUESCRIPT (Stratagene), in which the 
nucleotide coding sequence may be ligated into the vector in-frame with sequences for 
the amino-terminal Met and the subsequent 7 residues of beta-galactosidase so that a 
hybrid protein is produced; pIN vectors (Van Heeke & Schuster (1989) J Biol Chem 
264:5503-5509); pET vectors (Novagen, Madison WI); and the like. 
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Similarly, in the yeast Saccharomyces cerevisiae a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase and PGH may be 
used for production of the polypeptides of the invention. For reviews, see Ausubel, 
supra, Berger, supra, and Grant et al. (1987) Methods in Enzymology 153:516-544. 

In mammalian host cells, a number of expression systems, such as viral-based 
systems, may be utilized. In cases where an adenovirus is used as an expression vector, a 
coding sequence is optionally ligated into an adenovirus transcription/translation complex 
consisting of the late promoter and tripartite leader sequence. Insertion in a nonessential 
El or E3 region of the viral genome results in a viable virus capable of expressing a 
polypeptide of the invention in infected host cells (Logan and Shenk (1984) Proc Natl 
Acad Sci USA 81:3655-3659). In addition, transcription enhancers, such as the rous 
sarcoma virus (RSV) enhancer, are used to increase expression in mammalian host cells. 
Host cells, media, expression systems, and methods of production include those known 
for cloning and expression of various mammalian interferon-alphas (e.g., human 
interferon-alphas). 

Additional Expression Elements 

Specific initiation signals can aid in efficient translation of a polynucleotide coding 
sequence of the invention and/or fragments thereof. These signals can include, e.g., the 
ATG initiation codon and adjacent sequences. In cases where an coding sequence, its 
initiation codon and upstream sequences are inserted into the appropriate expression 
vector, no additional translational control signals may be needed. However, in cases 
where only coding sequence (e.g., a mature protein coding sequence), or a portion 
thereof, is inserted, exogenous nucleic acid transcriptional control signals including the 
ATG initiation codon must be provided. Furthermore, the initiation codon must be in the 
correct reading frame to ensure transcription of the entire insert. Exogenous 
transcriptional elements and initiation codons can be of various origins, both natural and 
synthetic. The efficiency of expression can enhanced by the inclusion of enhancers 
appropriate to the cell system in use (see, e.g., Scharf D. et al. (1994) Results Probl Cell 
Differ 20:125-62; and Bittner et al. (1987) Methods in Enzymol 153:516-544). 
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Secretion/Localization Sequences 

Polynucleotides encoding polypeptides of the invention can also be fused, for 
example, in-frame to nucleic acid encoding a secretion/localization sequence, to target 
polypeptide expression to a desired cellular compartment, membrane, or organelle, or to 

5 direct polypeptide secretion to the periplasmic space or into the cell culture media. Such 
sequences are known to those of skill, and include secretion leader or signal peptides, 
organelle targeting sequences (e.g., nuclear localization sequences, ER retention signals, 
mitochondrial transit sequences, chloroplast transit sequences), membrane 
localization/anchor sequences (e.g., stop transfer sequences, GPI anchor sequences), and 

10 the like. 

Expression Hosts 

In a further aspect, the present invention relates to host cells containing any of the 
above-described nucleic acids, vectors, or other constructs of the invention. The host cell 
can be a eukaryotic cell, such as a mammalian cell, a yeast cell, or a plant cell, or the host 

15 cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the construct into 
the host cell can be effected by calcium phosphate transfection, DEAE-Dextran mediated 
transfection, electroporation, gene or vaccine gun, injection, or other common techniques 
(see, e.g., Davis, L., Dibner, M., and Battey, I. (1986) Basic Methods in Molecular 
Biology) for in vivo, ex vivo or in vitro methods. 

20 A host cell strain is optionally chosen for its ability to modulate the expression of the 

inserted sequences or to process the expressed protein in the desired fashion. Such 
modifications of the protein include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation and acylation. Post-translational processing 
which cleaves a "pre" or a "prepro" form of the protein may also be important for correct 

25 insertion, folding and/or function. Different host cells such as E. coli, Bacillus sp., yeast 
or mammalian cells such as CHO, HeLa, BHK, MDCK, HEK 293, WI38, etc. have 
specific cellular machinery and characteristic mechanisms for such post-translational 
activities and may be chosen to ensure the correct modification and processing of the 
introduced foreign protein. 

30 Stable expression can be used for long-term, high-yield production of recombinant 

proteins. For example, cell lines which stably express a polypeptide of the invention are 
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transduced using expression vectors which contain viral origins of replication or 
endogenous expression elements and a selectable marker gene. Following the 
introduction of the vector, cells may be allowed to grow for 1-2 days in an enriched 
media before they are switched to selective media. The purpose of the selectable marker 
5 is to confer resistance to selection, and its presence allows growth and recovery of cells 
which successfully express the introduced sequences. For example, resistant clumps of 
stably transformed cells can be proliferated using tissue culture techniques appropriate to 
the cell type. 

Host cells transformed with a nucleotide sequence encoding a polypeptide of the 
10 invention are optionally cultured under conditions suitable for the expression and 
recovery of the encoded protein from cell culture. The polypeptide produced by a 
recombinant cell may be secreted, membrane-bound, or contained intracellularly, 
depending on the sequence and/or the vector used. As will be understood by those of 
skill in the art, expression vectors containing polynucleotides encoding polypeptides of 
15 the invention can be designed with signal sequences which direct secretion of the mature 
polypeptides through a prokaryotic or eukaryotic cell membrane. 

Additional Sequences 

The polynucleotides of the present invention optionally comprise a coding sequence 
fused in-frame to a marker sequence which, e.g., facilitates purification and/or detection 

20 of the encoded polypeptide. Such purification subsequences include, but are not limited 
to, metal chelating peptides such as histidine-tryptophan modules that allow purification 
on immobilized metals, a sequence which binds glutathione (e.g., GST), a hemagglutinin 
(HA) tag (corresponding to an epitope derived from the influenza hemagglutinin protein; 
Wilson, I. et al. (1984) Cell 37:767), maltose binding protein sequences, the FLAG 

25 epitope utilized in the FLAGS extension/affinity purification system, and the like. The 
inclusion of a protease-cleavable polypeptide linker sequence between the purification 
domain and the polypeptide sequence is useful to facilitate purification. 

For example, one expression vector possible to use in the compositions and methods 
described herein provides for expression of a fusion protein comprising a polypeptide of 

30 the invention fused to a polyhistidine region separated by an enterokinase cleavage site. 
The histidine residues facilitate purification on IMIAC (immobilized metal ion affinity 
chromatography, as described in Porath et al. (1992) Protein Expression and Purification 
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3:263-281) while the enterokinase cleavage site provides a method for separating the 
desired polypeptide from the polyhistidine region. pGEX vectors (Promega; Madison, 
WI) are optionally used to express foreign polypeptides as fusion proteins with 
glutathione S-transferase (GST). In general, such fusion proteins are soluble and can 
5 easily be purified from lysed cells by adsorption to ligand-agarose beads (e.g., 

glutathione-agarose in the case of GST-fusions) followed by elution in the presence of 
free ligand. 

An additional construction in the compositions and methods described herein 
provides for proteins, and their encoding nucleic acids, comprising polypeptides of the 

10 invention (or one or more fragments thereof), e.g., as described herein, fused to an Ig 
molecule, e.g., human IgG Fc ("fragment crystallizable," or fragment complement 
binding) hinge, CH2 domain and CH3 domain (and nucleotide sequences encoding them). 
Fc is the portion of the antibody responsible for binding to antibody receptors on cells and 
the Clq component of complement. These fusion proteins or fragments thereof and their 

15 encoding nucleic acids are optionally useful as prophylactic and/or therapeutic drugs or as 
diagnostic tools (see also, e.g., Challita-Eid, P. et al. (1998) J Immunol 160:3419-3426; 
Sturmhoefel, K. et al. (1999) Cancer Res 59:4964-4972). 

Polypeptide Production and Recovery 

Following transduction of a suitable host strain and growth of the host strain to an 
20 appropriate cell density, the selected promoter is induced by appropriate means (e.g., 
temperature shift or chemical induction) and cells are cultured for an additional period. 
Cells are typically harvested by centrifugation, disrupted by physical or chemical means, 
and the resulting crude extract retained for further purification. Eukaryotic or microbial 
cells employed in expression of the proteins can be disrupted by any convenient method, 
25 including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 
agents, or other methods, which are well know to those skilled in the art. 

As noted, many references are available for the culture and production of many cells, 
including cells of bacterial, plant, animal (especially mammalian) and archebacterial 
origin. See, e.g., Sambrook, Ausubel, and Berger (all supra), as well as Freshney (1994) 
30 Culture of Animal Cells, a Manual of Basic Technique, third edition, Wiley-Liss, New 
York and the references cited therein; Doyle and Griffiths (1997) Mammalian Cell 
Culture: Essential Techniques John Wiley and Sons, NY; Humason (1979) Animal Tissue 
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Techniques, fourth edition W.H. Freeman and Company; and Ricciardelli et al. (1989) In 
vitro Cell Dev Biol 25:1016-1024. For plant cell culture and regeneration see, e.g., Payne 
et al. (1992) Plant Cell and Tissue Culture in Liquid Systems John Wiley & Sons, Inc. 
New York, NY; Gamborg and Phillips (eds.) (1995) Plant Cell, Tissue and Organ 
5 Culture; Fundamental Methods Springer Lab Manual, Springer- Verlag (Berlin Heidelberg 
New York) and Plant Molecular Biology (1993) R.R.D. Croy (ed.) Bios Scientific 
Publishers, Oxford, U.K. ISBN 0 12 198370 6. Cell culture media in general are set forth 
in Atlas and Parks (eds.) The Handbook of Microbiological Media (1993) CRC Press, 
Boca Raton, FL. Additional information for cell culture is found in available commercial 

10 literature such as the Life Science Research Cell Culture Catalogue from Sigma- Aldrich, 
Inc (St Louis, MO) ("Sigma-LSRCCC") and, e.g., the Plant Culture Catalogue and 
supplement also from Sigma- Aldrich, Inc (St Louis, MO) ("Sigma-PCCS"). 

Polypeptides of the invention can be recovered and purified from recombinant cell 
cultures by any of a number of methods well known in the art, including ammonium 

15 sulfate or ethanol precipitation, acid extraction, anion or cation exchange 

chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography (e.g., using any of the tagging systems noted 
herein), hydroxylapatite chromatography, and lectin chromatography. Protein refolding 
steps can be used, as desired, in completing configuration of the mature protein or 

20 fragments thereof. Finally, high performance liquid chromatography (HPLC) can be 
employed in the final purification steps. In addition to the references noted, supra, a 
variety of purification methods are well known in the art, including, e.g., those set forth in 
Sandana (1997) Bioseparation of Proteins, Academic Press, Inc.; Bollag et al. (1996) 
Protein Methods, 2 nd Edition Wiley-Liss, NY; Walker (1996) The Protein Protocols 

25 Handbook Humana Press, NJ; Harris and Angal (1990) Protein Purification Applications: 
A Practical Approach IRL Press at Oxford, Oxford, England; Harris and Angal Protein 
Purification Methods: A Practical Approach IRL Press at Oxford, Oxford, England; 
Scopes (1993) Protein Purification: Principles and Practice 3 rd Edition Springer Verlag, 
NY; Janson and Ryden (1998) Protein Purification: Principles, High Resolution Methods 

30 and Applications, Second Edition Wiley- VCH, NY; and Walker (1998) Protein Protocols 
on CD-ROM Humana Press, NJ. 
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In vitro Expression Systems 

Cell-free transcription/translation systems can also be employed to produce 
polypeptides of the invention using polynucleotides of the present invention. Several 
such systems are commercially available. A general guide to in vitro transcription and 
translation protocols is found in Tymms (1995) In vitro Transcription and Translation 
Protocols: Methods in Molecular Biology Volume 37, Garland Publishing, NY. 

In vivo Uses and Applications 

Polynucleotides that encode a polypeptide of the invention, or complements of the 
polynucleotides (including e.g., antisense or ribozyme molecules), are optionally 
administered to a cell to accomplish a therapeutically useful process or to express a 
therapeutically useful product. These in vivo applications, including gene therapy, 
include a multitude of techniques by which gene expression may be altered in cells. Such 
methods include, for instance, the introduction of genes for expression of, e.g., 
therapeutically and/or prophylactically useful polypeptides, such as the polypeptides of 
the present invention. 

In vivo Polypeptide Expression 

Polynucleotides encoding polypeptides of the invention are particularly useful for in 
vivo therapeutic applications, using techniques well known to those skilled in the art. For 
example, cultured cells are engineered ex vivo with at least one polynucleotide (DNA or 
RNA) of the invention and/or other polynucleotide sequences encoding, e.g., at least one 
of an antigen, cytokine, other co-stimulatory molecule, adjuvant, etc., and the like, with 
the engineered cells then being returned to the patient. Cells may also be engineered in 
vivo for expression of one or more polypeptides in vivo, including polypeptides and/or 
antigenic peptides of the invention. 

A number of viral vectors suitable for organismal in vivo transduction and 
expression are known. Such vectors include retroviral vectors (see, e.g., Miller, Curr Top 
Microbiol Immunol (1992) 158:1-24; Salmons and Gunzburg (1993) Human Gene 
Therapy 4:129-141; Miller et al. (1994) Methods in Enzymology 217:581-599) and 
adeno-associated vectors (reviewed in Carter (1992) Curr Opinion Biotech 3:533-539; 
Muzcyzka (1992) Curr Top Microbiol Immunol. 158:97-129). Other viral vectors that are 
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used include adenoviral vectors, herpes viral vectors and Sindbis viral vectors, as 
generally described in, e.g., Jolly (1994) Cancer Gene Therapy 1:51-64; Latchman (1994) 
Molec Biotechnol 2:179-195; and Johanning et ah (1995) Nucl Acids Res 23:1495-1501. 
In one aspect, a pox virus vector can be used. The pox viral vector is transfected 
5 with a polynucleotide sequence encoding a polypeptide of the invention, such as an eIL-2 
polypeptide, and is useful in prophylactic, therapeutic and diagnostic applications where 
enhancement of an immune response, such as e.g., increased or improved T cell 
proliferation is desired. See viral vectors discussed in, e.g., Berencsi et al., J Infect Dis 
(2001)183(8): 1 171-9; Rosenwirth et al., Vaccine 2001 Feb 8;19(13-14):1661-70; 

10 Kittlesen et al., J Immunol (2000) 164(8):4204-1 1; Brown et al. Gene Ther 2000 

7(19):1680-9; Kanesa-thasan et al., Vaccine (2000) 19(4-5):483-91; Sten (2000) Drug 
60(2):249-71. Compositions comprising such vectors and an acceptable excipient are 
also a feature of the invention. 

Gene therapy and genetic vaccines provide methods for combating chronic 

15 infectious diseases (e.g., HIV infection, viral hepatitis), as well as non-infectious diseases 
including cancer and some forms of congenital defects such as enzyme deficiencies, and 
such methods can be employed with polynucleotides of the invention, including, e.g., 
vectors and cells comprising such polynucleotides. Several approaches for introducing 
nucleic acids and vectors into cells in vivo, ex vivo and in vitro have been used and can be 

20 employed with polynucleotides of the invention, and vectors comprising such 

polynucleotides. These approaches include liposome based gene delivery (Debs and Zhu 
(1993) WO 93/24640 and U.S. Pat No. 5,641,662; Mannino and Gould-Fogerite (1988) 
BioTechniques 6(7):682-691; Rose, U.S. Pat. No. 5,279,833; Brigham (1991) WO 
91/06309; and Feigner et al. (1987) Proc Natl Acad Sci USA 84:7413-7414; Brigham et 

25 al. (1989) Am J Med Sci 298:278-281; Nabel et al. (1990) Science 249:1285-1288; 
Hazinski et aL (1991) Am J Resp Cell Molec Biol 4:206-209; and Wang and Huang 
(1987) Proc Natl Acad Sci USA 84:7851-7855); adenoviral vector mediated gene 
delivery, e.g., to treat cancer (see, e.g., Chen et al. (1994) Proc Natl Acad Sci USA 
91:3054-3057; Tong et al. (1996) Gynecol Oncol 61:175-179; Clayman et al. (1995) 

30 Cancer Res. 5:1-6; O'Malley et al. (1995) Cancer Res 55:1080-1085; Hwang et al. (1995) 
Am J Respir Cell Mol Biol 13:7-16; Haddada et al. (1995) Curr Top Microbiol Immunol. 
1995 (Pt. 3):297-306; Addison et al. (1995) Proc Natl Acad Sci USA 92:8522-8526; 
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Colak et al. (1995) Brain Res 691:76-82; Crystal (1995) Science 270:404-410; Elshami et 
al. (1996) Human Gene Ther 7:141-148; Vincent et al. (1996) J Neurosurg 85:648-654), 
and many others. Replication-defective retroviral vectors harboring therapeutic 
polynucleotide sequence as part of the retroviral genome have also been used, particularly 

5 with regard to simple MuLV vectors. See, e.g., Miller et al. (1990) Mol Cell Biol 

10:4239 (1990); Kolberg (1992) J NIH Res 4:43, and Cornetta et al. (1991) Hum Gene 
Ther 2:215). Nucleic acid transport coupled to ligand-specific, cation-based transport 
systems (Wu and Wu (1988) J Biol Chem, 263: 14621-14624) has also been used. Naked 
DNA expression vectors have also been described (Nabel et al. (1990), supra); Wolff et 

10 al. (1990) Science, 247: 1465-1468). In general, these approaches can be adapted to the 
invention by incorporating nucleic acids encoding the polypeptides of the invention into 
the appropriate vectors. 

General texts which describe gene therapy protocols, which can be adapted to the 
present invention by introducing the nucleic acids of the invention into patients, include, 

15 e.g., Robbins (1996) Gene Therapy Protocols, Humana Press, NJ, and Joyner (1993) 
Gene Targeting: A Practical Approach, IRL Press, Oxford, England. 

Antisense Technology 

In addition to expression of the nucleic acids of the invention as gene replacement 
nucleic acids, the nucleic acids are also useful for sense and anti-sense suppression of 

20 expression, e.g., to down-regulate expression of a nucleic acid of the invention, once, or 
when, expression of the nucleic acid is no-longer desired in the cell. Similarly, the 
nucleic acids of the invention, or subsequences or anti-sense sequences thereof, can also 
be used to block expression of naturally occurring homologous nucleic acids. A variety 
of sense and anti-sense technologies are known in the art, e.g., as set forth in Lichtenstein 

25 and Nellen (1997) Antisense Technology: A Practical Approach IRL Press at Oxford 
University, Oxford, England, and in Agrawal (1996) Antisense Therapeutics Humana 
Press, NJ, and the references cited therein. 

Use as Probes 

Also contemplated are uses of polynucleotides, also referred to herein as 
30 oligonucleotides, typically having at least 12 bases, preferably at least 15, more 
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preferably at least 20, at least 30, or at least 50 or more bases, which hybridize under 
highly stringent conditions to a polynucleotide of the invention, or fragments thereof. 
The polynucleotides may be used as probes, primers, sense and antisense agents, and the 
like, according to methods as noted supra. 

Nucleic Acid Hybridization 

Nucleic acids "hybridize" when they associate, typically in solution. Nucleic acids 
hybridize due to a variety of well characterized physico-chemical forces, such as 
hydrogen bonding, solvent exclusion, base stacking and the like. An extensive guide to 
the hybridization of nucleic acids is found in Tijssen (1993) Laboratory Techniques in 
Biochemistry and Molecular Biology—Hybridization with Nucleic Acid Probes, part I, 
chapter 2, "Overview of principles of hybridization and the strategy of nucleic acid probe 
assays," (Elsevier, New York) (hereinafter "Tjissen"), as well as in Ausubel, supra,. 
Hames and Higgins (1995) Gene Probes 1, IRL Press at Oxford University Press, Oxford, 
England (Hames and Higgins 1) and Hames and Higgins (1995) Gene Probes 2, IRL 
Press at Oxford University Press, Oxford, England (Hames and Higgins 2) provide details 
on the synthesis, labeling, detection and quantification of DNA and RNA, including 
oligonucleotides. 

An indication that two nucleic acid sequences are substantially identical is that the 
two molecules hybridize to each other under at least stringent conditions. The phrase 
"hybridizing specifically to," refers to the binding, duplexing, or hybridizing of a 
molecule only to a particular nucleotide sequence under stringent conditions when that 
sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. "Bind(s) 
substantially" refers to complementary hybridization between a probe nucleic acid and a 
target nucleic acid and embraces minor mismatches that can be accommodated by 
reducing the stringency of the hybridization media to achieve the desired detection of the 
target polynucleotide sequence. 

"Stringent hybridization wash conditions" and "stringent hybridization conditions" 
in the context of nucleic acid hybridization experiments, such as Southern and northern 
hybridizations, are sequence dependent, and are different under different environmental 
parameters. An extensive guide to hybridization of nucleic acids is found in Tijssen 
(1993), supra, and in Hames and Higgins 1 and Hames and Higgins 2, supra. 
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For purposes of the present invention, generally, "highly stringent" hybridization and 
wash conditions are selected to be about 5° C or less lower than the thermal melting point 
(T m ) for the specific sequence at a defined ionic strength and pH (as noted below, highly 
stringent conditions can also be referred to in comparative terms). The T m is the 
5 temperature (under defined ionic strength and pH) at which 50% of the test sequence 

hybridizes to a perfectly matched probe. In other words, the T m indicates the temperature 
at which the nucleic acid duplex is 50% denatured under the given conditions and its 
represents a direct measure of the stability of the nucleic acid hybrid. Thus, the T m 
corresponds to the temperature corresponding to the midpoint in transition from helix to 

10 random coil; it depends on length, nucleotide composition, and ionic strength for long 
stretches of nucleotides. Typically, under "stringent conditions," a probe will hybridize 
to its target subsequence, but to no other sequences. "Very stringent conditions" are 
selected to be equal to the T m for a particular probe. 

After hybridization, unhybridized nucleic acid material can be removed by a series 

15 of washes, the stringency of which can be adjusted depending upon the desired results. 
Low stringency washing conditions (e.g., using higher salt and lower temperature) 
increase sensitivity, but can product nonspecific hybridization signals and high 
background signals. Higher stringency conditions (e.g., using lower salt and higher 
temperature that is closer to the hybridization temperature) lowers the background signal, 

20 typically with only the specific signal remaining. See, Rapley, R. and Walker, J.M. eds., 
Molecular Biomethods Handbook (Humana Press, Inc. 1998) (hereinafter "Rapley and 
Walker"), which is incorporated herein by reference in its entirety for all purposes. 

The T m of a DNA-DNA duplex can be estimated using equation (1): 
T m (°C) = 81.5°C + 16.6 (logioM) + 0.41 (%G + C) - 0.72 (%f) - 500/n, 

25 where M is the molarity of the monovalent cations (usually Na+), (%G + C) is the 

percentage of guanosine (G) and cystosine (C ) nucleotides, (%f) is the percentage of 
formalize and n is the number of nucleotide bases (i.e., length) of the hybrid. See, Rapley 
and Walker, supra. 

The T m of an RNA-DNA duplex can be estimated using equation (2): 

30 T m (°C) = 79.8°C + 18.5 (log 10 M) + 0.58 (%G + C) - 1 1.8(%G + C) 2 - 0.56 (%f) - 820/n, 
where M is the molarity of the monovalent cations (usually Na+), (%G + C)is the 
percentage of guanosine (G ) and cystosine (C ) nucleotides, (%f) is the percentage of 
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formamide and n is the number of nucleotide bases (i.e., length) of the hybrid. IcL 
Equations 1 and 2 above are typically accurate only for hybrid duplexes longer than about 
100-200 nucleotides. Id. 

The Tm of nucleic acid sequences shorter than 50 nucleotides can be calculated as 
5 follows: 

T m (°C) = 4(G + C) + 2(A + T), where A (adenine), C, T (thymine), and G are the 
numbers of the corresponding nucleotides. 

An example of stringent hybridization conditions for hybridization of 
complementary nucleic acids which have more than 100 complementary residues on a 

10 filter in a Southern or northern blot is 50% formalin (or formamide) with 1 mg of heparin 
at 42°C, with the hybridization being carried out overnight. An example of stringent 
wash conditions is a 0.2x SSC wash at 65°C for 15 minutes (see Sambrook, supra, for a 
description of SSC buffer). Often, the high stringency wash is preceded by a low 
stringency wash to remove background probe signal. An example low stringency wash is 

15 2x SSC at 40°C for 15 minutes. An example of highly stringent wash conditions is 

0.1 5M NaCl at 72°C for about 15 minutes. An example medium stringency wash for a 
duplex of, e.g., more than 100 nucleotides, is lx SSC at 45°C for 15 minutes. An 
example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4-6x 
SSC at 40°C for 15 minutes. For short probes (e.g., about 10 to 50 nucleotides), stringent 

20 conditions typically involve salt concentrations of less than about 1.0 M Na + ion, typically 
about 0.01 to 1.0 M Na + ion concentration (or other salts) at pH 7.0 to 8.3, and the 
temperature is typically at least about 30°C. Stringent conditions can also be achieved 
with the addition of destabilizing agents such as formamide. 

In general, a signal to noise ratio of 2x or 2.5x-5x (or higher) than that observed for 

25 an unrelated probe in the particular hybridization assay indicates detection of a specific 
hybridization. Detection of at least stringent hybridization between two sequences in the 
context of the present invention indicates relatively strong structural similarity or 
homology to, e.g., the nucleic acids of the present invention provided in the sequence 
listings herein. 

30 As noted, "highly stringent" conditions are selected to be about 5° C or less lower 

than the thermal melting point (T m ) for the specific sequence at a defined ionic strength 
and pH. Target sequences that are closely related or identical to the nucleotide sequence 
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of interest (e.g., "probe") can be identified under highly stringency conditions. Lower 
stringency conditions are appropriate for sequences that are less complementary. See, 
e.g., Rapley and Walker; Sambrook, all supra. 

Comparative hybridization can be used to identify nucleic acids of the invention, and 
this comparative hybridization method is a preferred method of distinguishing nucleic 
acids of the invention. Detection of highly stringent hybridization between two 
nucleotide sequences in the context of the present invention indicates relatively strong 
structural similarity/homology to, e.g., the nucleic acids provided in the sequence listing 
herein. Highly stringent hybridization between two nucleotide sequences demonstrates a 
degree of similarity or homology of structure, nucleotide base composition, arrangement 
or order that is greater than that detected by stringent hybridization conditions. In 
particular, detection of highly stringent hybridization in the context of the present 
invention indicates strong structural similarity or structural homology (e.g., nucleotide 
structure, base composition, arrangement or order) to, e.g., the nucleic acids provided in 
the sequence listings herein. For example, it is desirable to identify test nucleic acids 
which hybridize to the exemplar nucleic acids herein under stringent conditions. 

Thus, one measure of stringent hybridization is the ability to hybridize to one of the 
listed nucleic acids of the invention (e.g., nucleic acid sequences SEQ ID NOS:59-88, and 
complementary polynucleotide sequences thereof) under highly stringent conditions (or 
very stringent conditions, or ultra-high stringency hybridization conditions, or ultra-ultra 
high stringency hybridization conditions). Stringent hybridization (including, e.g., highly 
stringent, ultra-high stringency, or ultra-ultra high stringency hybridization conditions) 
and wash conditions can easily be determined empirically for any test nucleic acid. 

For example, in determining highly stringent hybridization and wash conditions, the 
hybridization and wash conditions are gradually increased (e.g., by increasing 
temperature, decreasing salt concentration, increasing detergent concentration and/or 
increasing the concentration of organic solvents, such as formalin, in the hybridization or 
wash), until a selected set of criteria are met. For example, the hybridization and wash 
conditions are gradually increased until a probe comprising one or more nucleic acid 
sequences selected from SEQ ID NOS:59-88, and complementary polynucleotide 
sequences thereof, binds to a perfectly matched complementary target (again, a nucleic 
acid comprising one or more nucleic acid sequences selected from SEQ ID NOS: 59-88, 
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and complementary polynucleotide sequences thereof), with a signal to noise ratio that is 
at least 2.5x, and optionally 5x or more as high as that observed for hybridization of the 
probe to an unmatched target. In this case, the unmatched target is a nucleic acid 
corresponding to, e.g., a known interferon-alpha nucleic acid sequence (e.g., an 

5 interferon-alpha nucleic acid sequence present in a public database such as GenBank or 
GENESEQ at the time of filing of the subject application). 

A test nucleic acid is said to specifically hybridize to a probe nucleic acid when it 
hybridizes at least Vi as well to the probe as to the perfectly matched complementary 
target, i.e., with a signal to noise ratio at least Vz as high as hybridization of the probe to 

10 the target under conditions in which the perfectly matched probe binds to the perfectly 
matched complementary target with a signal to noise ratio that is at least about 2.5x-10x, 
typically 5x-10x as high as that observed for hybridization to any of the unmatched target 
nucleic acids such as, e.g., a known interferon-alpha nucleic acid sequence as set forth 
above. For some such nucleic acids, the stringent conditions are selected such that a 

15 perfectly complementary oligonucleotide to the coding oligonucleotide hybridizes to the 
coding oligonucleotide with at least about a 5x higher signal to noise ratio than for 
hybridization of the perfectly complementary oligonucleotide to a control nucleic acid 
corresponding to a known interferon-alpha sequence as set forth above. 

Ultra high-stringency hybridization and wash conditions are those in which the 

20 stringency of hybridization and wash conditions are increased until the signal to noise 

ratio for binding of the probe to the perfectly matched complementary target nucleic acid 
is at least lOx as high as that observed for hybridization to any of the unmatched target 
nucleic acids, such as, e.g., a known interferon-alpha nucleic acid sequence as set forth 
above. A target nucleic acid which hybridizes to a probe under such conditions, with a 

25 signal to noise ratio of at least Vi that of the perfectly matched complementary target 
nucleic acid is said to bind to the probe under ultra-high stringency conditions. 

Similarly, even higher levels of stringency can be determined by gradually 
increasing the hybridization and/or wash conditions of the relevant hybridization assay. 
For example, those in which the stringency of hybridization and wash conditions are 

30 increased until the signal to noise ratio for binding of the probe to the perfectly matched 
complementary target nucleic acid is at least lOx, 20x, 50x, lOOx, or 500x or more as high 
as that observed for hybridization to any of the unmatched target nucleic acids, such as , 
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e.g., a known interferon-alpha nucleic acid sequence as set forth above. A target nucleic 
acid which hybridizes to a probe under such conditions, with a signal to noise ratio of at 
least Vi that of the perfectly matched complementary target nucleic acid is said to bind to 
the probe under ultra-ultra-high stringency conditions. 
5 Target nucleic acids which hybridize to one or more nucleic acids represented by 

SEQ ID NOS:59-88 under high, ultra-high and ultra-ultra high stringency conditions are 
a feature of the invention. Examples of such nucleic acids include those with one or a 
few silent or conservative nucleic acid substitutions as compared to a given nucleic acid 
sequence. 

10 

THERAPEUTIC USES 

Various interferon-alpha polypeptides and interferon-alpha conjugates have been 
approved or are currently in clinical development for treatment of a variety of conditions 
such as Chronic Hepatitis C, Chronic Hepatitis B, Hairy Cell Leukemia, Malignant 

15 Melanoma, Follicular Lymphoma, Condylomata Acuminata, ABDS-related Kaposi's 
Sarcoma, Non-Hodgkin's Lymphoma, Chronic Melogenous Leukemia, Basal Cell 
Carcinoma, Multiple Myeloma, carcinoid tumors, bladder cancer, Crohn's Disease, 
Cutaneous T Cell Lymphoma, Renal Cell Carcinoma, Multiple Sclerosis, and AIDS. 
Accordingly, the present invention includes a method of treating a condition which is 

20 responsive to interferon-alpha, such as for example a condition described above, 
comprising administering to a subject afflicted with the condition a composition 
comprising a polypeptide of the invention or a conjugate of the invention in an amount 
effective to ameliorate a symptom associated with the condition. The invention also 
includes the use of a composition comprising a polypeptide of the invention or a 

25 conjugate of the invention (i.e., a "composition of the invention") to treat a condition 

which is responsive to interferon-alpha, such as a condition described above, or any other 
condition which is responsive to a polypeptide of the invention or a conjugate of the 
invention. 

Treatment of Viral Infections and Conditions Associated with Viral Infection 
30 In one aspect, the invention provides a method for treating a subject infected with a 

virus, comprising administering to the subject a composition of the invention in an 
amount effective to decrease the level of the virus in the subject and/or to ameliorate a 
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symptom or condition associated with the viral infection. Exemplary viral infections 
contemplated for treatment methods of the invention include, but are not limited to, 
infection by a virus of the Flaviviridae family, such as, for example, Hepatitis C Virus, 
Yellow Fever Virus, West Nile Virus, Japanese Encephalitis Virus, Dengue Virus, or 

5 Bovine Viral Diarrhea Virus; infection by a virus of the Hepadnaviridae family, such as, 
for example, Hepatitis B Virus; infection by a virus of the Picornaviridae family, such as, 
for example, Encephalomyocarditis Virus, Human Rhinovirus, or Hepatitis A Virus; 
infection by a virus of the Retroviridae family, such as, for example, Human 
Immunodeficiency Virus, Simian Immunodeficiency Virus, Human T-Lymphotropic 

10 Virus, or Rous Sarcoma Virus; infection by a virus of the Coronaviridae family, such as, 
for example, SARS coronavirus; infection by a virus of the Rhabdoviridae family, such 
as, for example, Rabies Virus or Vesicular Stomatitis Virus, infection by a virus of the 
Paramyxoviridae family, such as, for example, Respiratory Syncytial Virus or 
Parainfluenza Virus, infection by a virus of the Papillomaviridae family, such as, for 

15 example, Human Papillomavirus, and infection by a virus of the Herpesviridae family, 
such as, for example, Herpes Simplex Virus. 

The following provides non-limiting examples for treatment of exemplary viral 
infections and diseases and conditions associated with such infections using polypeptides 
and conjugates of the invention, including suggested dosing schedules for polypeptides 

20 and conjugates of the invention and approaches to monitoring the efficacy of such 

treatments. The dosing schedules of polypeptides or conjugates of the invention for the 
treatment of other viral infections and diseases and conditions associated with viral 
infections, and approaches to monitoring the efficacy of such treatments, is ascertainable 
by one skilled in the art. 

25 Hepatitis C Virus 

In one aspect the invention provides a method of treating a patient infected with 
Hepatitis C Virus (HCV), comprising administering to the patient an effective amount of 
a composition of the invention comprising one or more polypeptide or conjugate of the 
invention. The invention also provides a composition for use in treating a patient infected 
30 with HCV, comprising one or more polypeptide or conjugate of the invention and a 
pharmaceutically acceptable carrier or excipient. A patient diagnosed as infected with 
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HCV includes a patient exhibiting HCV RNA in the blood and/or exhibiting anti-HCV 
antibody in the serum. 

A composition comprising a polypeptide of the invention will generally be 
administered at a dose and frequency similar to what is employed in HCV therapeutic 
5 regimens using clinically-approved interferon-alpha polypeptides, such as, e.g. 

ROFERON®-A (Interferon alfa-2a, recombinant; Hoffmann-La Roche Inc.), INTRON® 
A (Interferon alfa-2b, recombinant; Schering Corporation), and INFERGEN® (interferon 
alfacon-1; InterMune, Inc.). Exemplary recommended dosing schedules of ROFERON 
or INTRON A for the treatment of chronic HCV is 3 million IU (approximately 15 

10 micrograms (meg)) three times a week by subcutaneous injection for, e.g., 24 to 48 

weeks. An exemplary recommended dosing schedule of INFERGEN for the treatment of 
chronic HCV is 9 meg three times a week by subcutaneous injection for, e.g., 24 to 48 
weeks. Depending on a number of factors (including but not limited to the activity and 
the pharmacokinetics of the polypeptide of the invention and the size and health of the 

15 patient), the polypeptide may be administered in lower amounts (such as, for example, 

about 2, 3, 4, 5, 6, 7, or 8 meg) and/or less frequently (such as once per week or twice per 
week) than described above. 

Likewise, a composition comprising a conjugate of the invention will generally be 
administered at a dose and frequency similar to what is employed in HCV therapeutic 

20 regimens using clinically-approved interferon-alpha conjugates, such as, e.g., 

PEGASYS® (Peginterferon alfa-2a; Hoffmann-La Roche, Inc.) or PEG-INTRON® 
(peginterferon alfa-2b; Schering Corporation). An exemplary recommended dosing 
schedule of PEGASYS for the treatment of chronic HCV is 180 meg once weekly by 
subcutaneous injection for, e.g., 24 to 48 weeks. Depending on a number of factors 

25 (including but not limited to the molecular weight, activity, and pharmacokinetics of the 
conjugate of the invention and the size and health of the patient), the conjugate may be 
administered in lower amounts (such as, for example, about 25, 50, 75, 100, 125, or 150 
meg) and/or less frequently (such as once every 10 days, or once every 2 weeks) than 
described above. 

30 In some instances the polypeptide or conjugate of the invention is administered in 

combination with one or more additional therapeutic agent(s). For example, the 
polypeptide or conjugate of the invention may be administered in combination with a 
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small-molecule antiviral drug such as Ribavirin, which is sold under the names 
COPEGUS® (Hoffmann-La Roche, Inc) and REBETOL® (Schering Corporation). 
Alternatively, or in addition to a small-molecule antiviral drug, the polypeptide or 
conjugate of the invention may be administered in combination with one or more 
5 additional cytokine, such as, for example, IFN-gamma, which is sold under the name 

Actimmune® (interferon gamma-lb; InterMune, Inc.), IL-2, which is sold under the name 
PROLEUKIN® IL-2 (aldesleukin recombinant human interleukin-2 (rhIL-2); Chiron 
Corp.), or IL-12 (interleukin-12). 

The precise amount and frequency of administration of the polypeptide or conjugate 

10 of the invention will depend on a number of factors such as the specific activity and the 
pharmacokinetic properties of the polypeptide or the conjugate, as well as the nature of 
the condition being treated (such as, the genotype of the Hepatitis C virus being treated), 
among other factors known to those of skill in the art. Normally, the dose should be 
capable of preventing or lessening the severity or spread of the indication being treated. 

15 Such a dose may be termed an "effective" or "therapeutically effective" amount. It will 
be apparent to those of skill in the art that an effective amount of a polypeptide, conjugate 
or composition of the invention depends, inter alia, upon the condition being treated, the 
dose, the administration schedule, whether the polypeptide or conjugate or composition is 
administered alone or in combination with other therapeutic agents, the serum half-life 

20 and other pharmacokinetic properties of the polypeptide, conjugate or composition, as 
well as the size, age, and general health of the patient. The dosage and frequency of 
administration is ascertainable by one skilled in the art using known techniques. 

The effectiveness of treatment may be determined by measuring viral load, for 
example by determining the titer or level of virus in serum or plasma using methods 

25 known in the art, such as, e.g., by monitoring viral RNA levels using quantitative PCR- 
based tests, such as the COBAS AMPLICOR® HCV Test, v2.0 or the COBAS 
AMPLICOR HCV MONITOR® Test, v2.0 (both from Roche Diagnostics). In some 
instances, an effective amount of a composition of the invention is one that is sufficient to 
achieve a reduction in viral load by at least 2 log units, at least 3 log units, at least 4 log 

30 units, at least 5 log units, at least 6 log units or at least 7 log units over the course of 

treatment, compared to the viral load prior to treatment (which is generally in the range of 
10 5 -10 7 copies of HCV RNA/ml for chronic HCV patients). In some instances an 
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effective amount of a composition of the invention is an amount that is sufficient to 
reduce viral load to levels which are essentially undetectable, such as, for example, less 
than about 500 copies/ml serum or less than about 100 copies/ml serum. The invention 
includes a method of reducing the level of HCV RNA in serum of a patient infected with 

5 HCV, comprising administering to the patient a composition of the invention in an 
amount effective to reduce the level of HCV RNA compared to the HCV RNA level 
present prior to the start of treatment. 

The effectiveness of treatment may alternatively or in addition be determined by 
measuring a parameter indicative of a condition associated with HCV infection, such as, 

10 e.g., liver damage. For example, the level of serum alanine aminotransferase (ALT) may 
be measured using a standard assay. In general, an ALT level of less than about 50 
international units/ml (IU/ml) serum is considered normal. A higher ALT level may be 
indicative of ongoing liver damage. In some instances, an effective amount of a 
composition of the invention is an amount effective to reduce ALT level, in a patient with 

15 a higher than normal ALT level, to less than about 50 IU/ml of serum. Thus, the 

invention includes a method of reducing the serum ALT level of a patient infected with 
HCV exhibiting an initial ALT level greater than 50 IU/ml, comprising administering to 
the patient a composition of the invention in an amount effective to reduce the ALT level 
to less than about 50 IU/ml. 

20 Human Immunodeficiency Virus 

In another aspect the invention provides a method of treating a patient infected with 
Human Immunodeficiency Virus (HIV), such as HIV-1 or HIV-2, or a disease or 
condition associated with HIV infection, such as, for example, ADDS-related Kaposi's 
sarcoma, comprising administering to the patient an effective amount of a composition of 

25 the invention comprising one or more polypeptide or conjugate of the invention, 

optionally in association with other antiviral therapeutic agents as described below. The 
invention also provides a composition for use in treating a patient infected with HIV or a 
disease or condition associated with HIV infection, comprising one or more polypeptide 
or conjugate of the invention and a pharmaceutically acceptable carrier or excipient. A 

30 patient diagnosed as infected with HIV includes a patient exhibiting detectable levels of 

118 



WO 2005/113592 



PCT/US2005/017471 



HIV RNA or proviral DNA in the blood, and/or exhibiting detectable levels of p24 
antigen or anti-HIV antibody in serum. 

A composition comprising a polypeptide of the invention will generally be 
administered at a dose and frequency similar to what is employed in HIV therapeutic 
regimens using interferon-alpha polypeptides such as, e.g. ROFERON®-A (Interferon 
alfa-2a, recombinant; Hoffmann-La Roche Inc.), INTRON® A (Interferon alfa-2b, 
recombinant; Schering Corporation), and DsfFERGEN® (interferon alfacon-1; InterMune, 
Inc.). As was noted above, exemplary recommended dosing schedules of ROFERON or 
INTRON A for the treatment of chronic HCV is 3 million IU (approximately 15 
micrograms (meg)) three times a week by subcutaneous injection for, e.g., 24 to 48 
weeks, and a exemplary recommended dosing schedule of INFERGEN for the treatment 
of chronic HCV is 9 meg three times a week by subcutaneous injection for, e.g., 24 to 48 
weeks. An exemplary recommended dosing schedule of ROFERON for the treatment of 
AIDS-related Kaposi's sarcoma is 36 million units daily for 10 to 12 weeks, then 36 
million units 3 times a week. An exemplary recommended dosing schedule of INTRON 
A for the treatment of AIDS-related Kaposi's sarcoma is 30 million IU/m2 three times a 
week administered subcutaneously. Such dosing schedules provide useful ranges for 
dosage of a polypeptide of the invention for the treatment of HIV or a disease or 
condition associated with HIV infection. Depending on a number of factors (including 
but not limited to the activity and the pharmacokinetics of the polypeptide of the 
invention and the size, age and health of the patient), the polypeptide of the invention 
may be administered in lower amounts and/or less frequently than described above. 

Likewise, a composition comprising a conjugate of the invention will generally be 
administered at a dose and frequency similar to what is employed in HIV therapeutic 
regimens using interferon-alpha conjugates, such as, e.g., PEGASYS® (Peginterferon 
alfa-2a; Hoffmann-La Roche, Inc.) or PEG-INTRON® (peginterferon alfa-2b; Schering 
Corporation). An exemplary dosing schedule of PEG-INTRON for the treatment of HIV 
is between about 1.0 mcg/kg/week and 3.0 mcg/kg/week by subcutaneous injection for, 
e.g., 24 to 48 weeks. Such a dosing schedule provides a useful range for dosage of a 
conjugate of the invention for the treatment of HIV. Depending on a number of factors 
(including but not limited to the molecular weight, activity, and pharmacokinetics of the 
conjugate of the invention and the size, age and health of the patient), the conjugate may 
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be administered in lower amounts (such as, for example, about 0.1, 0.25, 0.50, or 0.75 
mcg/kg/week) and/or less frequently (such as once every 10 days, or once every 2 weeks) 
than described above. 

In some instances the polypeptide or conjugate of the invention is administered in 
5 combination with one or more additional therapeutic agent(s). Current clinical treatments 
of HIV-1 infection in man include multi-drug combination therapies generally termed 
Highly Active Antiretroviral Therapy ("HAART"). The polypeptide or conjugate of the 
invention may thus be administered in combination with HAART or other antiviral 
therapeutic compounds. Typical components of HAART, which involve various 

10 combinations of nucleoside reverse transcriptase inhibitors ("NRTI"), non-nucleoside 
reverse transcriptase inhibitors ("NNRTI") and HIV protease inhibitors ("PI"), are 
described, for example, in A.M. Vandamme et al. (1998) Antiviral Chemistry & 
Chemotherapy, 9: 187-203; "Drugs for HIV Infection" in The Medical Letter Vol. 39 
(Issue 1015) December 5, 1997, pages 1 1 1-116; and published United States Patent 

15 Application US 20020182179 Al; each of which is incorporated by reference herein. If 
the HIV-infected patient is also infected with HCV, the polypeptide or conjugate of the 
invention may be administered in combination with an antiviral drug such as Ribavirin, 
which is sold under the names COPEGUS® (Hoffmann-La Roche, Inc) and REBETOL® 
(Schering Corporation), along with HAART. 

20 The precise amount and frequency of administration of the polypeptide or conjugate 

of the invention, and administration of additional therapeutic agents such as HAART 
and/or Ribavirin, will depend on a number of factors such as the specific activity and the 
pharmacokinetic properties of the polypeptide or the conjugate, as well as the nature of 
the condition being treated (such as, the presence of additional viral infections such as 

25 HCV), among other factors known to those of skill in the art. Normally, the dose should 
be capable of preventing or lessening the severity or spread of the indication being 
treated. Such a dose may be termed an "effective" or "therapeutically effective" amount. 
It will be apparent to those of skill in the art that an effective amount of a polypeptide, 
conjugate or composition of the invention depends, inter alia, upon the condition being 

30 treated, the dose, the administration schedule, whether the polypeptide or conjugate or 
composition is administered alone or in combination with other therapeutic agents, the 
serum half-life and other pharmacokinetic properties of the polypeptide, conjugate or 
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composition, as well as the size, age, and general health of the patient. The dosage and 
frequency of administration is ascertainable by one skilled in the art using known 
techniques. 

In addition to general uses described above, a polypeptide or conjugate of the 
5 invention may be administered to the following subsets of patients infected with HIV: as 
an adjuvant therapy, for example to HA ART as described above; as monotherapy or 
combination therapy in early stage patients when the viral load is generally high; as a 
combined anti- viral and immunodulatory agent for patients undergoing structured 
treatment interruptions (STI) or "drug holidays"; as salvage therapy in patients whose 
10 HAART options are limited; as an antiviral method of treatment to keep viral load in 
check without initiating HAART therapy in order to delay the appearance of HAART 
resistant virus. 

The effectiveness of treatment may be determined by measuring viral load, for 
example by determining the titer or level of virus in serum or plasma using methods 

15 known in the art, such as, e.g., by monitoring HIV-1 viral RNA levels using quantitative 
RT-PCR based tests, such as the AMPLICOR HIV-1 MONITOR® Test, vl.5 (Roche 
Diagnostics). In some instances, an effective amount of a composition of the invention is 
one that is sufficient to achieve a reduction in viral load by at least 0.5 log units, at least 1 
log unit, at least 2 log units, at least 3 log units, at least 4 log units, at least 5 log units, at 

20 least 6 log units or at least 7 log units over the course of treatment, compared to the viral 
load prior to treatment. In some instances an effective amount of a composition of the 
invention is an amount that is sufficient to reduce viral load to levels which are essentially 
undetectable, such as, for example, less than about 50-100 copies HIV-1 RNA per ml 
serum. The invention includes a method of reducing the level of HIV RNA in serum of a 

25 patient infected with HIV, comprising administering to the patient a composition of the 
invention in an amount effective to reduce the level of HIV RNA compared to the HIV 
RNA level present prior to the start of treatment. 

The effectiveness of treatment may alternatively or in addition be determined 
by a serum markers for HIV replication, such as the presence of HIV p24 antigen in the 

30 blood. In some instances an effective amount of a composition of the invention is an 
-amount that is sufficient to reduce the level of p24 antigen in the blood to 50%, 25%, 
10% or 5% of the level present prior to the start of treatment. In some instances an 
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effective amount of a composition of the invention is an amount that is sufficient to 
reduce the level of p24 antigen to a level which is essentially undetectable. The invention 
includes a method of reducing the level of p24 antigen in serum of a patient infected with 
HIV, comprising administering to the patient a composition of the invention in an amount 
5 effective to reduce the level of p24 antigen compared to the p24 antigen level present 
prior to the start of treatment. 

Hepatitis B Virus 

In another aspect, the invention provides a method of treating a patient infected with 
Hepatitis B Virus (HBV), comprising administering to the patient an effective amount of 

10 a composition of the invention comprising one or more polypeptide or conjugate of the 
invention. The invention also provides a composition for use in treating a patient infected 
with HBV, comprising one or more polypeptide or conjugate of the invention and a 
pharmaceutically acceptable carrier or excipient. 

A patient diagnosed as infected with HBV exhibits detectable hepatitis B surface 

15 antigen (HBsAg) in the serum. Chronic HBV infection is further categorized as either 
"replicative" or "non-replicative". In replicative infection, the patient usually has a 
relatively high serum concentration of viral DNA and detectable HBeAg, which is an 
alternatively processed protein of the HBV pre-core gene that is synthesized under 
conditions of high viral replication. However, in rare strains of HBV with mutations in 

20 the pre-core gene, replicative infection can occur in the absence of detectable serum 
HBeAg. Patients with chronic hepatitis B and replicative infection have a generally 
worse prognosis and a greater chance of developing cirrhosis and/or hepatocellular 
carcinoma than those without HBeAg. In non-replicative infection, the rate of viral 
replication in the liver is low, serum HBV DNA concentration is generally low and 

25 hepatitis Be antigen (HBeAg) is not detected. 

A composition comprising a polypeptide of the invention will generally be 
administered at a dose and frequency similar to what is employed in HBV therapeutic 
regimens using clinically-approved interferon-alpha polypeptides, such as, e.g. 
INTRON® A (Interferon alfa-2b, recombinant; Schering Corporation). An exemplary 

30 recommended dosing schedule of INTRON A for the treatment of chronic HBV in adults 
is 30 to 35 million IU per week by subcutaneous or intramuscular injection, either as 5 
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million IU per day (qd) or as 10 million IU three times per week (tiw) for 16 weeks. 
Depending on a number of factors (including, but not limited to, the activity and the 
pharmacokinetics of the polypeptide of the invention, and the size and health of the 
patient), the polypeptide of the invention may be administered in lower amounts (such as, 

5 for example, about 5, 10, 15, 20, or 25 million IU per week) and/or less frequently (such 
as once per week or twice per week) than described above. 

Likewise, a composition comprising a conjugate of the invention will generally be 
administered at a dose and frequency similar to what is employed in HBV therapeutic 
regimens using interferon-alpha conjugates currently undergoing clinical trials, such as, 

10 e.g., PEGASYS® (Peginterferon alfa-2a; Hoffmann-La Roche, Inc.). Exemplary dosing 
schedules of PEGASYS for the treatment of chronic HBV is between 90 mcg-270 meg 
injected once per week for a total of 24 weeks. Depending on a number of factors 
(including but not limited to the molecular weight, activity, and pharmacokinetics of the 
conjugate of the invention and the size and health of the patient), the conjugate may be 

15 administered in lower amounts (such as, for example, about 25, 50, 75, 100, 125, 150, or 
200 meg) and/or less frequently (such as once every 10 days, or once every 2 weeks) than 
described above. 

In some instances the polypeptide or conjugate of the invention is administered in 
combination with one or more additional therapeutic agent(s). For example, the 

20 polypeptide or conjugate of the invention may be administered in combination with 
antiviral drugs such as lamivudine (also known as 3TC), which is sold under the name 
Epivir-HB V® (GlaxoSmithKline), or adefovir dipivoxil, which is sold under the name 
Hepsera® (Gilead Sciences). 

The precise amount and frequency of administration of the polypeptide or conjugate 

25 of the invention will depend on a number of factors such as the specific activity and the 
pharmacokinetic properties of the polypeptide or the conjugate, as well as the nature of 
the condition being treated (such as, e.g., in the case of chronic HBV infection, whether 
the infection is replicative or non-replicative), among other factors known to those of skill 
in the art. Normally, the dose should be capable of preventing or lessening the severity 

30 or spread of the indication being treated. Such a dose may be termed an "effective" or 
"therapeutically effective" amount. It will be apparent to those of skill in the art that an 
effective amount of a polypeptide, conjugate or composition of the invention depends, 
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inter alia, upon the condition being treated, the dose, the administration schedule, 
whether the polypeptide or conjugate or composition is administered alone or in 
combination with other therapeutic agents, the serum half-life and other pharmacokinetic 
properties of the polypeptide, conjugate or composition, as well as the size, age, and 
5 general health of the patient. The dosage and frequency of administration is ascertainable 
by one skilled in the art using known techniques. 

The effectiveness of treatment may be determined for example by measuring the 
viral load, e.g. the level of viral DNA in serum or plasma, using methods known in the 
art. Methods for monitoring HBV DNA levels include quantitative PCR-based tests, such 

1 0 as the COB AS AMPLICOR HBV MONITOR® Test, v2.0 or the AMPLICOR HBV 
MONITOR® Test, v2.0 (both from Roche Diagnostics). In some instances an effective 
amount of a composition of the invention is an amount that is sufficient to reduce viral 
DNA to, e.g., less than about 500,000 copies/ml serum or less than about 100,000 
copies/ml serum or less than about 10,000 copies/ml serum, or to levels which are 

15 essentially undetectable (such as, for example, less than about 1000 copies/ml serum, less 
than about 500 copies/ml serum, or less than about 200 copies/ml serum). The invention 
includes a method of reducing the level of HBV DNA in serum of a patient infected with 
HBV, comprising administering to the patient a composition of the invention in an 
amount effective to reduce the level of HBV DNA compared to the HBV DNA level 

20 present prior to the start of treatment. 

The effectiveness of treatment may alternatively or in addition be determined by 
measuring other serum markers for HBV replication, such as HBeAg. In some instances 
an effective amount of a composition of the invention is an amount that is sufficient to 
reduce the level of HBeAg in serum to 50%, 25%, 10% or 5% of the level present prior 

25 to the start of treatment. In some instances an effective amount of a composition of the 
invention is an amount that is sufficient to reduce the level of HBeAg to a level which is 
essentially undetectable. The invention includes a method of reducing the level of HBeAg 
in serum of a patient infected with HBV, comprising administering to the patient a 
composition of the invention in an amount effective to reduce the level of HBeAg 

30 compared to the HBeAg level present prior to the start of treatment. 

As discussed above, another serum marker indicative of HBV infection is HBsAg. 
Thus, the effectiveness of treatment may alternatively or in addition be determined by 
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measuring the level of HBsAg in the serum. In some instances an effective amount of a 
composition of the invention is an amount that is sufficient to reduce the level of HBsAg 
in serum to 50%, 25%, 10% or 5% of the level present prior to the start of treatment. In 
some instances an effective amount of a composition of the invention is an amount that is 
5 sufficient to reduce level of HBsAg to a level which is essentially undetectable. The 

invention includes a method of reducing the level of HBsAg in serum of a patient infected 
with HBV, comprising administering to the patient a composition of the invention in an 
amount effective to reduce the level of HBsAg compared to the HBsAg level present 
prior to the start of treatment. 

10 The effectiveness of treatment may alternatively or in addition be determined by 

measuring a parameter indicative of a condition associated with HBV infection, such as, 
e.g., liver damage. For example, the level of serum alanine aminotransferase (ALT) may 
be measured using a standard assay. In general, an ALT level of less than about 50 
international units/ml (IU/ml) serum is considered normal. A higher ALT level may be 

15 indicative of ongoing liver damage. In some instances, an effective amount of a 

composition of the invention is an amount effective to reduce ALT level, in a patient with 
a higher than normal ALT level, to less than about 50 IU/ml of serum. Thus, the 
invention includes a method of reducing the serum ALT level of a patient infected with 
HBV exhibiting an initial ALT level greater than 50 IU/ml, comprising administering to 

20 the patient a composition of the invention in an amount effective to reduce the ALT level 
to less than about 50 IU/ml. 

Human T-Lymphotropic Virus type 1 

In another aspect the invention provides a method of treating a patient infected with 
Human T-Lymphotropic Virus type 1 (HTLV-1), or a disease or condition associated 

25 with HTLV-1 infection, such as, for example, adult T-cell leukemia/1 ymphoma (ATLL), 
HTLV-1 -associated myelopathy (HAM), Tropical Spastic Paraparesis (TSP), uveitis, or 
arthropathy. The method comprises administering to the patient an effective amount of a 
composition of the invention comprising one or more polypeptide or conjugate of the 
invention. The invention also provides a composition for use in treating a patient infected 

30 with HTLV-1, or a disease or condition associated with HTLV-1 infection, the 

composition comprising one or more polypeptide or conjugate of the invention and a 
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pharmaceutically acceptable carrier or excipient. A patient diagnosed with HTLV-1 
infection includes a patient exhibiting HTLV-1 pro viral DNA in the blood and/or 
antibody to an HTLV-1 antigen in the serum. 

A composition comprising a polypeptide of the invention will generally be 
5 administered at a dose and frequency similar to what is employed in HCV or oncology 
therapeutic regimens using clinically-approved interferon-alpha polypeptides, such as, 
e.g. ROFERON®-A (Interferon alfa-2a, recombinant; Hoffmann-La Roche Inc.) and 
INTRON® A (Interferon alfa-2b, recombinant; Schering Corporation). Exemplary 
recommended dosing schedules of ROFERON or INTRON A for the treatment of chronic 

10 HCV is 3 million IU (approximately 15 micrograms (meg)) three times a week by 

subcutaneous injection for, e.g., 24 to 48 weeks. An exemplary recommended dosing 
schedule of ROFERON for the treatment of hairy-cell leukemia is 3-5 million units daily 
by subcutaneous injection for 16 to 24 weeks, then 3 million units 3 times a week for 
maintenance. An exemplary recommended dosing schedule of INTRON A for the 

15 treatment of hairy-cell leukemia is 2 million IU/m2 (square meter of body surface ) 

administered subcutaneously 3 times a week for 6 months. Such dosing schedules provide 
useful ranges for dosage of a polypeptide of the invention for the treatment of HTLV-1 
infection, or a disease or condition associated with HTLV-1 infection such as adult T-cell 
leukemia/lymphoma (ATLL), HTLV-1 -associated myelopathy (HAM), or Tropical 

20 Spastic Paraparesis (TSP). Depending on a number of factors (including but not limited 
to the activity and the pharmacokinetics of the polypeptide of the invention and the size, 
age and health of the patient), the polypeptide may be administered in lower amounts 
and/or less frequently than described above. 

Likewise, a composition comprising a conjugate of the invention will generally be 

25 administered at a dose and frequency similar to what is employed in HCV therapeutic or 
oncology therapeutic regimens using clinically-approved interferon-alpha conjugates, 
such as, e.g., PEGASYS® (Peginterferon alfa-2a; Hoffmann-La Roche, Inc.) or PEG- 
INTRON® (peginterferon alfa-2b; Schering Corporation). An exemplary recommended 
dosing schedule of PEGASYS for the treatment of chronic HCV is 180 meg once weekly 

30 by subcutaneous injection for, e.g., 24 to 48 weeks. An exemplary recommended dosing 
schedule of PEG-ESfTRON for the treatment of chronic myelogenous leukemia is 6 meg 
/kg body weight once weekly by subcutaneous injection for, e.g., 52 weeks. Such dosing 
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schedules provide useful ranges for dosage of a conjugate of the invention for the 
treatment of HTLV-1 infection, or a disease or condition associated with HTLV-1 
infection such as adult T-cell leukemia/lymphoma (ATLL), HTLV-1 -associated 
myelopathy (HAM), or Tropical Spastic Paraparesis (TSP). Depending on a number of 

5 factors (including but not limited to the molecular weight, activity, and pharmacokinetics 
of the conjugate of the invention and the size, age and health of the patient), the conjugate 
may be administered in lower amounts and/or less frequently than described above. 

In some instances the polypeptide or conjugate of the invention is administered in 
combination with one or more additional therapeutic agent(s). For example, the 

10 polypeptide or conjugate of the invention may be administered in combination with an 
antiretro viral drug such as zidovudine (AZT) and/or lamivudine (3TC). It may also be 
administered in combination with peripheral blood stem cell transplantation, conventional 
chemotherapy, or high dose chemotherapy with autologous or allogeneic bone marrow 
transplantation. Alternatively, the polypeptide or conjugate of the invention may be 

1 5 combined with other immunotherapy, for example with anti-interleukin-2 receptor 

monoclonal antibodies or injection of cytotoxic T-cells directed against virus antigens. 

The precise amount and frequency of administration of the polypeptide or conjugate 
of the invention will depend on a number of factors such as the specific activity and the 
pharmacokinetic properties of the polypeptide or the conjugate, as well as the nature of 

20 the condition being treated, among other factors known to those of skill in the art. 

Normally, the dose should be capable of preventing or lessening the severity or spread of 
the indication being treated. Such a dose may be termed an "effective" or 
"therapeutically effective" amount. It will be apparent to those of skill in the art that an 
effective amount of a polypeptide, conjugate or composition of the invention depends, 

25 inter alia, upon the condition being treated, the dose, the administration schedule, 
whether the polypeptide or conjugate or composition is administered alone or in 
combination with other therapeutic agents, the serum half-life and other pharmacokinetic 
properties of the polypeptide, conjugate or composition, as well as the size, age, and 
general health of the patient. The dosage and frequency of administration is ascertainable 

30 by one skilled in the art using known techniques. 

The effectiveness of treatment may be determined by measuring the HTLV-1 viral 
load, such as, for example, measuring the level of HTLV-1 proviral DNA in the blood 
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using methods known in the art, for example by quantitative PCR as described by Saito et 
al, (2004) J. Infect Dis. 189(l):29-40. In some instances, an effective amount of a 
composition of the invention is one that is sufficient to achieve a reduction in viral load 
by at least 0.5 log unit, such as at least 1 log unit, at least 2 log units, at least 3 log units, 
5 at least 4 log units, at least 5 log units, at least 6 log units, or at least 7 log units over the 
course of treatment, compared to the viral load prior to treatment. In some instances an 
effective amount of a composition of the invention is an amount that is sufficient to 
reduce viral load to levels which are essentially undetectable. The invention includes a 
method of reducing the level of HTLV-1 pro viral DNA in blood of a patient infected with 

10 HTLV-1, comprising administering to the patient a composition of the invention in an 
amount effective to reduce the level of HTLV-1 proviral DNA compared to that present 
prior to the start of treatment. 

The effectiveness of treatment may alternatively or in addition be determined by 
measuring titer of an anti-HTLV-1 antibody in the serum, using methods known in the 

15 art, such as, for example, by commercially-available tests such as INNO-LIA™ HTLV 
Vll (Innogenetics; Gent Belgium) and Abbott HTLV-I/HTLV-II E1A (Abbott 
Laboratories; Abbott Park, EL). In some instances an effective amount of a composition 
of the invention is an amount that is sufficient to reduce the titer of an anti-HTLV-1 
antibody in the serum to 50%, 25%, 10% or 5% of the titer present prior to the start of 

20 treatment. In some instances an effective amount of a composition of the invention is an 
amount that is sufficient to reduce the titer of an anti-HTLV-1 antibody in the serum to a 
level which is essentially undetectable. The invention includes a method of reducing the 
titer of an anti-HTLV-1 antibody in the serum of a patient infected with HTLV-1, 
comprising administering to the patient a composition of the invention in an amount 

25 effective to reduce the titer of the anti-HTLV-1 antibody in the serum compared to that 
present prior to the start of treatment. 

Human Papillomavirus 

In another aspect the invention provides a method of treating a patient infected with 
Human Papillomavirus (HPV), or a disease or condition associated with HPV infection, 
30 such as, for example, warts of the hands and feet, and lesions of the mucous membranes 
of the oral, anal and genital cavities. While many types of HPV are relatively harmless, 
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many other types are spread through sexual contact and give rise to genital or venereal 
warts (termed condylomata acuminata) which may give rise to cervical cancer and other 
genital cancers. The method comprises administering to the patient infected with HPV 
an effective amount of a composition of the invention comprising one or more 

5 polypeptide or conjugate of the invention. The invention also provides a composition for 
use in treating a patient infected with HPV, or a disease or condition associated with HPV 
infection, the composition comprising one or more polypeptide or conjugate of the 
invention and a pharmaceutical^ acceptable carrier or excipient. A patient diagnosed 
with HPV infection includes a patient exhibiting HPV viral DNA in biopsied genital 

10 tissue, and sometimes (but not always) exhibiting visible lesions on genital tissues. 
A composition comprising a polypeptide of the invention will generally be 
administered at a dose and frequency similar to what is employed in HPV therapeutic 
regimens using clinically-approved interferon-alpha polypeptides, such as, for example, 
LMTRON® A (Interferon alfa-2b, recombinant; Schering Corporation). A recommended 

15 dose of INTRON A for the treatment of condylomata acuminata is 1.0 million IU 

injected into each lesion, for up to 5 lesions, using a tuberculin or similar syringe and a 
25-to 30-gauge needle, three times per week on alternate days, for 3 weeks. Patients with 
6 to 10 condylomata may receive a second (sequential) course of treatment at the above 
dosage schedule, to treat up to five additional condylomata per course of treatment. 

20 Patients with greater than 10 condylomata may receive additional sequences depending 
on how large a number of condylomata are present. The interferon may alternatively or 
in addition be applied topically, e.g. in a cream or ointment form (as described for 
example in Stentella et al (1996) Clin. Exp. Obstet. Gynecol. 23(l):29-36). Such dosing 
schedules provide useful ranges for dosage of a polypeptide of the invention for the 

25 treatment of HPV infection, or a disease or condition associated with HPV infection such 
as condylomata acuminata. Depending on a number of factors (including but not limited 
to the activity and the pharmacokinetics of the polypeptide of the invention and the size, 
age and health of the patient), the polypeptide of the invention may be administered in 
lower amounts and/or less frequently than described above. Likewise, a composition 

30 comprising a conjugate of the invention will generally be administered, e.g. 

intralesionally or topically, at a dose effective to reduce the amount of HPV viral DNA in 
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the effected tissues or to reduce the size/or number of genital lesions in the infected 
individual. 

In some instances the polypeptide or conjugate of the invention is administered in 
combination with one or more additional therapeutic agent(s). For example, the 
5 polypeptide or conjugate of the invention may be administered in combination with an 
anti-HPV therapeutic such as Podofilox (Condylox) and/or Podophyllin (Pododerm, 
Podocon-25). 

The precise amount and frequency of administration of the polypeptide or conjugate 
of the invention will depend on a number of factors such as the specific activity and the 

10 pharmacokinetic properties of the polypeptide or the conjugate, as well as the nature of 
the condition being treated, among other factors known to those of skill in the art. 
Normally, the dose should be capable of preventing or lessening the severity or spread of 
the indication being treated. Such a dose may be termed an "effective" or 
"therapeutically effective" amount. It will be apparent to those of skill in the art that an 

15 effective amount of a polypeptide, conjugate or composition of the invention depends, 
inter alia, upon the condition being treated, the dose, the administration schedule, 
whether the polypeptide or conjugate or composition is administered alone or in 
combination with other therapeutic agents, the serum half-life and other pharmacokinetic 
properties of the polypeptide, conjugate or composition, as well as the size, age, and 

20 general health of the patient. The dosage and frequency of administration is ascertainable 
by one skilled in the art using known techniques. 

The effectiveness of treatment may be determined by measuring the HPV viral load, 
such as, for example, measuring the level of HPV viral DNA in biopsied tissue. In some 
instances, an effective amount of a composition of the invention is one that is sufficient to 

25 achieve a reduction in viral load by at least 0.5 log unit, such as at least 1 log unit, at least 
2 log units, at least 3 log units, at least 4 log units, at least 5 log units, at least 6 log units, 
or at least 7 log units over the course of treatment, compared to the viral load prior to 
treatment. In some instances an effective amount of a composition of the invention is an 
amount that is sufficient to reduce viral load to levels which are essentially undetectable. 

30 The invention includes a method of reducing the level of HPV viral DNA in tissue of a 
patient infected with HPV, comprising administering to the patient a composition of the 
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invention in an amount effective to reduce the level of HPV viral DNA compared to that 

present prior to the start of treatment. 

The effectiveness of treatment may alternatively or in addition be determined by 

observing the size or number of genital lesions (condylomata) in the infected individual. 
5 In some instances an effective amount of a composition of the invention is an amount that 

is sufficient to reduce the size and/or number of condylomata in the infected individual. 

The invention includes a method of reducing reduce the size and/or number of 

condylomata in a patient infected with HPV, comprising administering to the patient a 

composition of the invention in an amount effective to reduce the size and/or number of 
10 condylomata in the patient compared to those present prior to the start of treatment. 

FORMULATIONS AND ROUTES OF ADMINISTRATION 

Therapeutic formulations of the polypeptide or conjugate of the invention are 
typically administered in a composition that includes one or more pharmaceutically 
15 acceptable carriers or excipients. Such pharmaceutical compositions may be prepared in a 
manner known per se in the art to result in a polypeptide pharmaceutical that is 
sufficiently storage-stable and is suitable for administration to humans or animals. 

Drug form 

The polypeptide or conjugate of the invention can be used "as is" and/or in a salt 
20 form thereof. Suitable salts include, but are not limited to, salts with alkali metals or 

alkaline earth metals, such as sodium, potassium, calcium and magnesium, as well as e.g. 
zinc salts. These salts or complexes may by present as a crystalline and/or amorphous 
structure. 

Excipients 

25 "Pharmaceutically acceptable" means a carrier or excipient that at the dosages and 

concentrations employed does not cause any untoward effects in the patients to whom it is 
administered. Such pharmaceutically acceptable carriers and excipients are well known in 
the art (see Remington's Pharmaceutical Sciences, 18th edition, A. R. Gennaro, Ed., Mack 
Publishing Company (1990); Pharmaceutical Formulation Development of Peptides and 
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Proteins, S. Frokjaer and L. Hovgaard, Eds., Taylor & Francis (2000) ; and Handbook of 
Pharmaceutical Excipients, 3rd edition, A. Kibbe, Ed., Pharmaceutical Press (2000)). 

Mix of drugs 

The composition of the invention may be administered alone or in conjunction with 
other therapeutic agents. Ribavirin, for example, is often co-administered with IFN-alpha 
and has been shown to increase efficacy in antiviral treatments, such as HCV treatment. 
A variety of small molecules are being developed against both viral targets (viral 
proteases, viral polymerase, assembly of viral replication complexes) and host targets 
(host proteases required for viral processing, host kinases required for phosphorylation of 
viral targets such as NS5A and inhibitors of host factors required to efficiently utilize the 
viral IRES). Other cytokines may be co-administered, such as for example IL-2, IL-12, 
IL-23, IL-27, or IFN-gamma. These agents may be incorporated as part of the same 
pharmaceutical composition or may be administered separately from the polypeptide or 
conjugate of the invention, either concurrently or in accordance with another treatment 
schedule. In addition, the polypeptide, conjugate or composition of the invention may be 
used as an adjuvant to other therapies. 

Patients 

A "patient" for the purposes of the present invention includes both humans and other 
mammals. Thus the methods are applicable to both human therapy and veterinary 
applications. 

Types of composition and administration route 

The pharmaceutical composition comprising the polypeptide or conjugate of the 
invention may be formulated in a variety of forms, e.g. as a liquid, gel, lyophilized, or as 
a compressed solid. The preferred form will depend upon the particular indication being 
treated and will be apparent to one skilled in the art. 

The administration of the formulations of the present invention can be performed in 
a variety of ways, including, but not limited to, orally, subcutaneously, intravenously, 
intracerebrally, intranasally, transdermally, intraperitoneally, intramuscularly, 
intrapulmonary, vaginally, rectally, intraocularly, or in any other acceptable manner. The 
formulations can be administered continuously by infusion, although bolus injection is 
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acceptable, using techniques well known in the art, such as pumps (e.g., subcutaneous 
osmotic pumps) or implantation. In some instances the formulations may be directly 
applied as a solution or spray. 

Parenterals 

An example of a pharmaceutical composition is a solution designed for parenteral 
administration. Although in many cases pharmaceutical solution formulations are 
provided in liquid form, appropriate for immediate use, such parenteral formulations may 
also be provided in frozen or in lyophilized form. In the former case, the composition 
must be thawed prior to use. The latter form is often used to enhance the stability of the 
active compound contained in the composition under a wider variety of storage 
conditions, as it is recognized by those skilled in the art that lyophilized preparations are 
generally more stable than their liquid counterparts. Such lyophilized preparations are 
reconstituted prior to use by the addition of one or more suitable pharmaceutically 
acceptable diluents such as sterile water for injection or sterile physiological saline 
solution. 

Parenterals may be prepared for storage as lyophilized formulations or aqueous 
solutions by mixing, as appropriate, the polypeptide having the desired degree of purity 
with one or more pharmaceutically acceptable carriers, excipients or stabilizers typically 
employed in the art (all of which are termed "excipients"), for example buffering agents, 
stabilizing agents, preservatives, isotonifiers, non-ionic detergents, antioxidants and/or 
other miscellaneous additives. 

Buffering agents help to maintain the pH in the range which approximates 
physiological conditions. They are typically present at a concentration ranging from about 
2 mM to about 50 mM. Suitable buffering agents for use with the present invention 
include both organic and inorganic acids and salts thereof such as citrate buffers (e.g., 
monosodium citrate-disodium citrate mixture, citric acid-trisodium citrate mixture, citric 
acid-monosodium citrate mixture, etc.), succinate buffers (e.g., succinic acid- 
monosodium succinate mixture, succinic acid-sodium hydroxide mixture, succinic acid- 
disodium succinate mixture, etc.), tartrate buffers (e.g., tartaric acid-sodium tartrate 
mixture, tartaric acid-potassium tartrate mixture, tartaric acid-sodium hydroxide mixture, 
etc.), fumarate buffers (e.g., fumaric acid-monosodium fumarate mixture, fumaric acid- 
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disodium fumarate mixture, monosodium fumarate-disodium fumarate mixture, etc.), 
gluconate buffers (e.g., gluconic acid-sodium glyconate mixture, gluconic acid-sodium 
hydroxide mixture, gluconic acid-potassium glyuconate mixture, etc.), oxalate buffer 
(e.g., oxalic acid-sodium oxalate mixture, oxalic acid-sodium hydroxide mixture, oxalic 

5 acid-potassium oxalate mixture, etc.), lactate buffers (e.g., lactic acid-sodium lactate 

mixture, lactic acid-sodium hydroxide mixture, lactic acid-potassium lactate mixture, etc.) 
and acetate buffers (e.g., acetic acid-sodium acetate mixture, acetic acid-sodium 
hydroxide mixture, etc.). Additional possibilities are phosphate buffers, histidine buffers 
and trimethylamine salts such as Tris. 

10 Preservatives are added to retard microbial growth, and are typically added in 

amounts of about 0.2%- 1% (w/v). Suitable preservatives for use with the present 
invention include phenol, benzyl alcohol, meta-cresol, methyl paraben, propyl paraben, 
octadecyldimethylbenzyl ammonium chloride, benzalkonium halides (e.g. benzalkonium 
chloride, bromide or iodide), hexamethonium chloride, alkyl parabens such as methyl or 

15 propyl paraben, catechol, resorcinol, cyclohexanol and 3-pentanol. 

Isotonicifiers are added to ensure isotonicity of liquid compositions and include 
polyhydric sugar alcohols, preferably trihydric or higher sugar alcohols, such as glycerin, 
erythritol, arabitol, xylitol, sorbitol and mannitoL Polyhydric alcohols can be present in an 
amount between 0.1% and 25% by weight, typically 1% to 5%, taking into account the 

20 relative amounts of the other ingredients. 

Stabilizers refer to a broad category of excipients which can range in function from a 
bulking agent to an additive which solubilizes the therapeutic agent or helps to prevent 
denaturation or adherence to the container wall. Typical stabilizers can be polyhydric 
sugar alcohols (enumerated above); amino acids such as arginine, lysine, glycine, 

25 glutamine, asparagine, histidine, alanine, ornithine, L-leucine, 2-phenylalanine, glutamic 
acid, threonine, etc., organic sugars or sugar alcohols, such as lactose, trehalose, 
stachyose, mannitol, sorbitol, xylitol, ribitol, myoinisitol, galactitol, glycerol and the like, 
including cyclitols such as inositol; polyethylene glycol; amino acid polymers; sulfur- 
containing reducing agents, such as urea, glutathione, thioctic acid, sodium thioglycolate, 

30 thioglycerol, a-monothioglycerol and sodium thiosulfate; low molecular weight 

polypeptides (i.e. <10 residues); proteins such as human serum albumin, bovine serum 
albumin, gelatin or immunoglobulins; hydrophilic polymers such as 
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polyvinylpyrrolidone; monosaccharides such as xylose, mannose, fructose and glucose; 
disaccharides such as lactose, maltose and sucrose; trisaccharides such as raffinose, and 
polysaccharides such as dextran. Stabilizers are typically present in the range of from 0. 1 
to 10,000 parts by weight based on the active protein weight. 

Non-ionic surfactants or detergents (also known as "wetting agents") may be present 
to help solubilize the therapeutic agent as well as to protect the therapeutic polypeptide 
against agitation-induced aggregation, which also permits the formulation to be exposed 
to shear surface stress without causing denaturation of the polypeptide. Suitable non-ionic 
surfactants include polysorbates (20, 80, etc.), polyoxamers (184, 188 etc.), Pluronic® 
polyols, polyoxyethylene sorbitan monoethers (Tween®-20, Tween®-80, etc.). 

Additional miscellaneous excipients include bulking agents or fillers (e.g. starch), 
chelating agents (e.g. EDTA), antioxidants (e.g., ascorbic acid, methionine, vitamin E) 
and cosolvents. 

The active ingredient may also be entrapped in microcapsules prepared, for example, 
by coascervation techniques or by interfacial polymerization, for example 
hydroxymethylcellulose, gelatin or poly-(methylmethacylate) microcapsules, in colloidal 
drug delivery systems (for example liposomes, albumin microspheres, microemulsions, 
nano-particles and nanocapsules) or in macroemulsions. Such techniques are disclosed in 
Remington's Pharmaceutical Sciences, supra. 

In one aspect of the invention the composition is a liquid composition, such as an 
aqueous composition, and comprises a sulfoalkyl ether cyclodextrin derivative of the 
formula 




wherein 
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n is 4, 5 or 6; Ri, R 2 , R3, R4, Rs> Re, R7, Rs, and R 9 are each, independently, -O- or a -O- 
(C 2 -C6 alkyl)-S03- group, wherein at least one of Ri, R 2 or R3 is independently a -0-(C 2 - 
C 6 alkyl)-S0 3 - group; and Si, S 2 , S3, S 4 > S 5 , S 6 , S 7 , S 8 , and S 9 are each, independently, a 
pharmaceutically acceptable cation, including H + . 

It should be noted that when n=4, the sulfoalkyl ether cyclodextrin may also be 
referred to as a oc-sulfoalkyl ether cyclodextrin. In a similar way, when n=5, the term (3- 
sulfoalkyl ether cyclodextrin may be employed and when n=6, the sulfoalkyl ether 
cyclodextrin may also be referred to as a y-sulfoalkyl ether cyclodextrin. 

In a further embodiment, n is 5 or 6. In a preferred embodiment n=6. 

In a still further embodiment R], R 2 or R 3 is independently selected from the 
group consisting of -OCH 2 CH 2 CH 2 S0 3 -, -OCH 2 CH 2 CH 2 CH 2 S0 3 - and 
-OCH 2 CH 2 CH 2 CH 2 CH 2 S0 3 -. Most preferably, Ri, R 2 or R 3 is independently 
-OCH 2 CH 2 CH 2 CH 2 S0 3 -. 

In a further embodiment Si, S 2 , S 3 , S4, S5, Se, S7, Sg, and S9 are each, 
independently, a pharmaceutically acceptable cation selected from H + , alkali metals (e.g. 
Li + , Na + , K + ), alkaline earth metals (e.g., Ca +2 , Mg +2 ), ammonium ions and amine cations 
such as the cations of (Ci-Ce) alkylamines, piperidine, pyrazine, (Ci-C 6 ) alkanolamine 
and (C 4 -C 8 )cycloalkanolamine. Most preferably, Si, S 2 , S 3 , S4, S 5 , Se, S 7 , Sg, and S9 are 
each, independently, a pharmaceutically acceptable cation selected from the group 
consisting of H + , Li + , Na + , K + , in particular Na + . 

The sulfoalkyl ether cyclodextrin may contain from 1 to 18 sulfoalkyl groups 
(when n=4), from 1-21 sulfoalkyl groups (when n=5) or from 1-21 (when n=6). In a 
preferred embodiment of the invention n=5 and the sulfoalkylether derivative comprises, 
on average, 2-20 sulfoalkyl groups (in particular sulfobutyl groups), such as 3-10 
sulfoalkyl groups (in particular sulfobutyl groups), more preferably 4-9 sulfoalkyl groups 
(in particular sulfobutyl groups), even more preferably 5-9 sulfoalkyl groups (in particular 
sulfobutyl groups), such as 6-8 sulfoalkyl groups (in particular sulfobutyl groups), e.g. 7 
sulfoalkyl groups (in particular sulfobutyl groups). 

In some instances the sulfoalkyl ether cyclodextrin derivative is a salt, in 
particular a sodium salt, of P-cyclodextrin sulfobutyl ether (i.e. n=5), which on average 
contains 7 sulfobutyl groups. This sulfoalkyl ether cyclodextrin derivative is also termed 
SBE7-p-CD and is available as Captisol® (Cydex, Overland Park, Kansas). 
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The term "Ci-Ce alkyl" represents a branched or straight alkyl group having from 
one to six carbon atoms. Typical Ci-Ce alkyl groups include, but are not limited to, 
methyl ethyl, n-propyl, iso-propyl, n-butyl, iso-butyl, sec-butyl, tert-butyl, n-pentyl, iso- 
pentyl, n-hexyl and iso-hexyl. 

The term "C2-C6 alkyl" represents a branched or straight alkyl group having from 
two to six carbon atoms. Typical C2-C6 alkyl groups include, but are not limited to, ethyl, 
n-propyl, iso-propyl, n-butyl, iso-butyl, sec-butyl, tert-butyl, n-pentyl, iso-pentyl, n-hexyl 
and iso-hexyl. 

Further details concerning compositions comprising the polypeptides disclosed 
herein and sulfoalkyl ether cyclodextrin derivatives can be found in WO 03/002152, 
particularly the section entitled 'The sulfoalkyl ether cyclodextrin derivative" on pp. 37- 
49, incorporated herein by reference. 

Parenteral formulations to be used for in vivo administration must be sterile. This is 
readily accomplished, for example, by filtration through sterile filtration membranes. 

Sustained release preparations 

Suitable examples of sustained-release preparations include semi-permeable matrices 
of solid hydrophobic polymers containing the polypeptide or conjugate, the matrices 
having a suitable form such as a film or microcapsules. Examples of sustained-release 
matrices include polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate) 
or poly(vinylalcohol)), polylactides, copolymers of L-glutamic acid and ethyl-L- 
glutamate, non-degradable ethylene- vinyl acetate, degradable lactic acid-glycolic acid 
copolymers such as the ProLease® technology or Lupron Depot® (injectable 
microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), 
and poly-D-(-)-3-hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and 
lactic acid-glycolic acid enable release of molecules for long periods such as up to or over 
100 days, certain hydrogels release proteins for shorter time periods. When encapsulated 
polypeptides remain in the body for a long time, they may denature or aggregate as a 
result of exposure to moisture at 37°C, resulting in a loss of biological activity and 
possible changes in immunogenicity. Rational strategies can be devised for stabilization 
depending on the mechanism involved. For example, if the aggregation mechanism is 
discovered to be intermolecular S-S bond formation through thio-disulfide interchange, 
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stabilization may be achieved by modifying sulfhydryl residues, lyophilizing from acidic 
solutions, controlling moisture content, using appropriate additives, and developing 
specific polymer matrix compositions. 

Oral administration 

5 For oral administration, the pharmaceutical composition may be in solid or liquid 

form, e.g. in the form of a capsule, tablet, suspension, emulsion or solution. The 
pharmaceutical composition is preferably made in the form of a dosage unit containing a 
given amount of the active ingredient. A suitable daily dose for a human or other mammal 
may vary widely depending on the condition of the patient and other factors, but can be 

10 determined by persons skilled in the art using routine methods. 

Solid dosage forms for oral administration may include capsules, tablets, 
suppositories, powders and granules. In such solid dosage forms, the active compound 
may be admixed with at least one inert diluent such as sucrose, lactose, or starch. Such 
dosage forms may also comprise, as is normal practice, additional substances, e.g. 

15 lubricating agents such as magnesium stearate. In the case of capsules, tablets and pills, 
the dosage forms may also comprise buffering agents. Tablets and pills can additionally 
be prepared with enteric coatings. 

The polypeptides or conjugates may be admixed with adjuvants such as lactose, 
sucrose, starch powder, cellulose esters of alkanoic acids, stearic acid, talc, magnesium 

20 stearate, magnesium oxide, sodium and calcium salts of phosphoric and sulphuric acids, 
acacia, gelatin, sodium alginate, polyvinyl-pyrrolidine, and/or polyvinyl alcohol, and 
tableted or encapsulated for conventional administration. Alternatively, they may be 
dissolved in saline, water, polyethylene glycol, propylene glycol, ethanol, oils (such as 
corn oil, peanut oil, cottonseed oil or sesame oil), tragacanth gum, and/or various buffers. 

25 Other adjuvants and modes of administration are well known in the pharmaceutical art. 
The carrier or diluent may include time delay material, such as glyceryl monostearate or 
glyceryl distearate alone or with a wax, or other materials well known in the art. 

The pharmaceutical compositions may be subjected to conventional pharmaceutical 
operations such as sterilization and/or may contain conventional adjuvants such as 

30 preservatives, stabilizers, wetting agents, emulsifiers, buffers, fillers, etc., e.g. as 
disclosed elsewhere herein. 
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Liquid dosage forms for oral administration may include pharmaceutical^ 
acceptable emulsions, solutions, suspensions, syrups and elixirs containing inert diluents 
commonly used in the art, such as water. Such compositions may also comprise adjuvants 
such as wetting agents, sweeteners, flavoring agents and perfuming agents. 

5 Pulmonary delivery 

Formulations suitable for use with a nebulizer, either jet or ultrasonic, will typically 
comprise the polypeptide or conjugate dissolved in water at a concentration of, e.g., about 
0.01 to 25 mg of conjugate per mL of solution, preferably about 0.1 to 10 mg/mL. The 
formulation may also include a buffer and a simple sugar (e.g., for protein stabilization 

10 and regulation of osmotic pressure), and/or human serum albumin ranging in 

concentration from 0.1 to 10 mg/ml. Examples of buffers that may be used are sodium 
acetate, citrate and glycine. Preferably, the buffer will have a composition and molarity 
suitable to adjust the solution to a pH in the range of 3 to 9. Generally, buffer molarities 
of from 1 mM to 50 mM are suitable for this purpose. Examples of sugars which can be 

15 utilized are lactose, maltose, mannitol, sorbitol, trehalose, and xylose, usually in amounts 
ranging from 1% to 10% by weight of the formulation. 

The nebulizer formulation may also contain a surfactant to reduce or prevent surface 
induced aggregation of the protein caused by atomization of the solution in forming the 
aerosol. Various conventional surfactants can be employed, such as polyoxyethylene fatty 

20 acid esters and alcohols, and polyoxyethylene sorbitan fatty acid esters. Amounts will 
generally range between 0.001% and 4% by weight of the formulation. An especially 
preferred surfactant for purposes of this invention is polyoxyethylene sorbitan 
monooleate. 

Specific formulations and methods of generating suitable dispersions of liquid 
25 particles of the invention are described in WO 94/20069, US 5,915,378, US 5,960,792, 
US 5,957,124, US 5,934,272, US 5,915,378, US 5,855,564, US 5,826,570 and US 
5,522,385 which are hereby incorporated by reference. 

Formulations for use with a metered dose inhaler device will generally comprise a 
finely divided powder. This powder may be produced by lyophilizing and then milling a 
30 liquid conjugate formulation and may also contain a stabilizer such as human serum 
albumin (HSA). Typically, more than 0.5% (w/w) HSA is added. Additionally, one or 
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more sugars or sugar alcohols may be added to the preparation if necessary. Examples 
include lactose maltose, mannitol, sorbitol, sorbitose, trehalose, xylitol, and xylose. The 
amount added to the formulation can range from about 0.01 to 200% (w/w), preferably 
from approximately 1 to 50%, of the conjugate present. Such formulations are then 

5 lyophilized and milled to the desired particle size. 

The properly sized particles are then suspended in a propellant with the aid of a 
surfactant. The propellant may be any conventional material employed for this purpose, 
such as a chlorofluorocarbon, a hydrochlorofluorocarbon, a hydrofluorocarbon, or a 
hydrocarbon, including trichlorofluoromethane, dichlorodifluoromethane, 

1 0 dichlorotetrafluoroethanol, and 1,1,1 ,2-tetrafluoroethane, or combinations thereof. 

Suitable surfactants include sorbitan trioleate and soya lecithin. Oleic acid may also be 
useful as a surfactant. This mixture is then loaded into the delivery device. An example of 
a commercially available metered dose inhaler suitable for use in the present invention is 
the Ventolin metered dose inhaler, manufactured by Glaxo Inc., Research Triangle Park, 

15 NC,USA. 

Formulations for powder inhalers will comprise a finely divided dry powder 
containing conjugate and may also include a bulking agent, such as lactose, sorbitol, 
sucrose, or mannitol in amounts which facilitate dispersal of the powder from the device, 
e.g., 50% to 90% by weight of the formulation. The particles of the powder shall have 

20 aerodynamic properties in the lung corresponding to particles with a density of about 1 
g/cm 2 having a median diameter less than 10 micrometers, preferably between 0.5 and 5 
micrometers, most preferably of between 1.5 and 3.5 micrometers. An example of a 
powder inhaler suitable for use in accordance with the teachings herein is the Spinhaler 
powder inhaler, manufactured by Fisons Corp., Bedford, MA, USA. 

25 The powders for these devices may be generated and/or delivered by methods 

disclosed in US 5,997,848, US 5,993,783, US 5,985,248, US 5,976574, US 5,922,354, 
US 5,785,049 and US 5,654,007. 

Mechanical devices designed for pulmonary delivery of therapeutic products, 
include but are not limited to nebulizers, metered dose inhalers, and powder inhalers, all 

30 of which are familiar to those of skill in the art. Specific examples of commercially 

available devices suitable for the practice of this invention are the Ultravent nebulizer, 
manufactured by Mallinckrodt, Inc., St. Louis, MO, USA; the Acorn II nebulizer, 
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manufactured by Marquest Medical Products, Englewood, CO, USA; the Ventolin 
metered dose inhaler, manufactured by Glaxo Inc., Research Triangle Park, NC, USA; the 
Spinhaler powder inhaler, manufactured by Fisons Corp., Bedford, MA, USA the 
"standing cloud" device of Nektar Therapeutics, Inc., San Carlos, CA, USA; the AIR 
5 inhaler manufactured by Alkermes, Cambridge, MA, USA; and the AERx pulmonary 
drug delivery system manufactured by Aradigm Corporation, Hayward, CA, USA. 

KITS 

The present invention also provides kits including the polypeptides, conjugates, 
polynucleotides, expression vectors, cells, methods, compositions, and systems, and 

10 apparatuses of the invention. Kits of the invention optionally comprise at least one of the 
following of the invention: (1) an apparatus, system, system component, or apparatus 
component as described herein; (2) at least one kit component comprising a polypeptide 
or conjugate or polynucleotide of the invention; a plasmid expression vector encoding a 
polypeptide of the invention; a cell expressing a polypeptide of the invention; or a 

15 composition comprising at least one of any such component; (3) instructions for 

practicing any method described herein, including a therapeutic or prophylactic method, 
instructions for using any component identified in (2) or any composition of any such 
component; and/or instructions for operating any apparatus, system or component 
described herein; (4) a container for holding said at least one such component or 

20 composition, and (5) packaging materials. 

In a further aspect, the present invention provides for the use of any apparatus, 
component, composition, or kit described above and herein, for the practice of any 
method or assay described herein, and/or for the use of any apparatus, component, 
composition, or kit to practice any assay or method described herein. 

25 

EXAMPLES 

The following examples are offered to illustrate the present invention, but not to 
limit the spirit or scope of the present invention in any way. 
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MATERIALS AND METHODS 

I. DETERMINATION OF SURFACE-ACCESSIBLE RESIDUES 

Accessible Surface Area (ASA) 

The computer program Access (B. Lee and F.M. Richards, J. Mol. Biol. 55: 379-400 
5 (1971)) version 2 (©1983 Yale University) was used to compute the accessible surface 
area (ASA) of the individual atoms in the structure. This method typically uses a probe- 
size of 1.4 A and defines the Accessible Surface Area (ASA) as the area formed by the 
center of the probe. Prior to this calculation all water molecules and all hydrogen atoms 
should be removed from the coordinate set, as should other atoms not directly related to 
10 the protein. 

Fractional ASA of side chain 

The fractional ASA of the side chain atoms is computed by division of the sum of 
the ASA of the atoms in the side chain by a value representing the ASA of the side chain 
atoms of that residue type in an extended ALA-x-ALA tripeptide. See Hubbard, Campbell 
15 & Thornton (1991) 7. Mol Biol 220, 507-530. For this example the CA atom is regarded 
as a part of the side chain of glycine residues but not for the remaining residues. The 
following values are used as standard 100% ASA for the side chain (Table 6): 



Ala 


69.23 


A 2 


Leu 


140.76 


A 2 


Arg 


200.35 


A 2 


Lys 


162.50 


A 2 


Asn 


106.25 


A 2 


Met 


156.08 


A 2 


Asp 


102.06 


A 2 


Phe 


163.90 


A 2 


Cys 


96.69 


A 2 


Pro 


119.65 


A 2 


Gin 


140.58 


A 2 


Ser 


78.16 


A 2 


Glu 


134.61 


A 2 


Thr 


101.67 


A 2 


Gly 


32.28 


A 2 


Trp 


210.89 


A 2 


His 


147.00 


A 2 


Tyr 


176.61 


A 2 


He 


137.91 


A 2 


Val 


114.14 


A 2 



20 

Residues not detected in the structure are defined as having 100% exposure as they 
are thought to reside in flexible regions. In the case where an ensemble of NMR 
structures is analyzed, the average ASA value of the ensemble is used. 
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Determination of surface exposed residues when no three-dimensional structure is 
available: 

When no three-dimensional structure is available or if the structure is not detailed 
enough to determine surface accessibility (e.g. if only the position of the CA atoms is 
5 known) the surface accessibility may be inferred from a sequence alignment created as 
follows: 

A: If the structure is known but not detailed enough to determine surface 
accessibility: 

The low detail structure is included in a structure-based sequence alignment 
10 to the known structures of the sequence family using the MODELER 

program available from Molecular Simulations, Inc. 
B: If no structure is known: 

The sequence is aligned to a predefined sequence alignment, including the 
sequences of the known structures of the sequence family, that may be 
15 prepared using the "profile/structure alignment" option of the program 

ClustalW (Thompson et al. (1994) Nucleic Acids Research 22:4673-4680). 
From the sequence alignment obtained in A or B, residues in the sequence to be 
analyzed at positions equivalent to residues exposed in at least one of the other sequences 
having a known structure are defined as being exposed. The degree of exposure is taken 
20 to be the largest value for the equivalent residues in the other sequences. In cases where 
the sequence to be analyzed is at an insertion (i.e. there are no equivalent residues in the 
other sequences) this residue is defined as being fully exposed, as it most probably is 
located in a turn/loop region. In cases where a low detailed structure exists, those residues 
not observed in the structure are defined as being fully exposed, as they are thought to be 
25 in flexible regions. 

Determining distances between atoms: 

The distance between atoms is readily determined using molecular graphics 
software, e.g. Insightll® 98.0 from Molecular Simulations, Inc. 
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II. PROTEIN EXPRESSION AND PURIFICATION 

A. Expression and Purification from CHO Cells 

Some polypeptides of the invention were produced in Chinese Hamster Ovary 
(CHO) Kl cells (ATCC: CCL-61) that were stably transfected and selected with G418 to 
5 establish clonal cell lines. 

1. CHO Expression Construct: 

Nucleic acids encoding polypeptides of the invention were cloned into a CHO 
expression vector, under control of the SV40 promoter and in-frame with a sequence 
which encodes an N-terminal leader sequence, and, optionally, one or two a C-terminal 
10 tag sequences. The leader sequence was either a generic leader sequence, IFN alpha 6 
leader or a modified IFN alpha 6 leader sequence, and C-terminal tags included an E-tag 
(Amersham Biosciences) &/or a His-tag. Plasmid production was in XL 1 -Blue cells. 

2. Selection of stable subclones expressing IFN-alpha polypeptides: 
Materials: 

15 Culture medium: DMEM-F12 with G418, FBS and Penicillin, Streptomycin and 

Glutamine (PSG; Gibco/Invitrogen) ; 
lxPBS (Gibco/Invitrogen); 
Trypsin/EDTA 

Anti E-Tag Antibody-HRP conjugate (Amersham Biosciences) 
20 ECL Plus Western Blotting detection Reagents (Amersham Biosciences) 

Procedure: Stable transfectants were generated under selection with G418 in 
DMEM/F-12 medium with FBS and penicillin. Cells were split into T175 flasks with 50 
ml of selection medium and incubated in a 37°C CO2 incubator for -24 hr. or until cells 
reached 80% confluence. Cells were harvested by washing with PBS followed by 
25 addition of 2.5ml Trypsin/EDTA and incubation at 37°C for 3-5 min. Cells were 

collected and recovered by centrifugation at lOOOg for 30 min in a Beckman Model bench 
top centrifuge. Cells were washed once in PBS and resuspended in 3 ml PBS with 1% 
FBS. The cell density was determined and adjusted to lxlO 6 cell/ml with PBS/FBS. For 
each IFN-alpha, polypeptide cells were sorted in a DakoCytomation MoFlo sorter into 2- 
30 5 96 well plates containing 200ml of selection medium. The plates were incubated in a 
37°C incubator for 10-14 days to allow the sorted cells to grow. Two subclones were 
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selected for each IFN-alpha polypeptide for high level expression first by dot blot 
analysis and subsequently confirmed by Western blot analysis using an anti-E tag 
antibody-HRP conjugate and chemiluminescent detection. 

3. Protein Expression: 

5 Materials: 

DMEM-F12 medium (Gibco/Invitrogen) 
Ultra CHO medium (BioWhittaker) 
CHO III A medium (Gibco/Invitrogen) 

Ex-Cyte Growth Enhancing Media supplement (Serologicals Proteins) 
10 ITS A (Insulin, Transferrin, Selenium supplement for adherent culture; 

Gibco/Invitrogen) 

Penicillin/Streptomycin (P/S) 
FBS, PBS, Trypsin 
Procedure: 

15 Day 1: Cells from one T-175 flask were transferred to one roller bottle (1700 cm 2 ) 

in 300 ml DMEM-F12 with 10% FBS and lxP/S and grown in a 37°C C0 2 incubator. 
Day 3: Medium was changed to 300 ml fresh DMEM -F 1 2-FB S -P/S . 
Day 5: The medium was changed to 300 ml Ultra CHO with 1/1000 Ex-Cyte and 

P/S. 

20 Day 7: The media was replaced with 300 ml CHO III A + P/S production medium. 

Supernatants were harvested on Day 8, 9 and 10. The supernatants were centrifuged 
at 2000g for 20 min in a Beckman Coulter Allegra 6R bench top centrifuge and filtered 
using a 0.2jx PES bottle top sterile filter and stored at 4°C for purification. 

4. Protein Purification: 

25 Some polypeptides of the invention were expressed as fusion proteins containing a 

> 

13 amino acid E-tag sequence at the C-terminus. Such polypeptides were purified using 
an E-tag affinity column, as follows. 

Materials: Recombinant Phage Antibody System Purification Module (Amersham 
Biosciences, Cat. No. 17-1362-01). Purification kit contains a 16mm diameter x 25mm 
30 height (5 ml bed volume) anti E-Tag column and associated buffers. 
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Procedure: Supernatants collected from CHO-HK1 cells in roller bottles were 
clarified using a combination of centrifugation at 2800xg for 20 min and filtration using a 
0.2|a PES bottle top filter module. Supernatants were loaded onto the E-tag column 
equilibrated in RPAS binding buffer at 150 cm/h (5 ml/min). The column was washed 
5 with 5 CV (column volume) of binding buffer and the protein was eluted at 75 cm/h 
(2.5ml/min) with RPAS elution buffer. Elution fractions were neutralized with 0.05 
volumes of 1M Tris-Cl pH 8.0, dialyzed into PBS, concentrated to 0.1-1.5 mg/ml and 
stored frozen in aliquots at -80°C. Samples for assays were formulated at 50 |LLg/ml in 
PBS with 0.5% BSA and stored frozen in aliquots at -80°C. Samples were routinely 
10 analyzed by SDS-PAGE followed by Coomassie staining using materials, reagents and 
protocols obtained from Invitrogen. 

Protein concentrations were routinely determined by the BCA assay using an IFN- 
alpha standard, the concentration of which had been verified by amino acid analysis. 

B. Expression and Purification from E. coli 

15 Some polypeptides of the invention were produced in E. coli as inclusion bodies, 

which were purified and refolded as follows. 

la. E. coli Expression Construct 

In some instances, nucleic acids encoding polypeptides of the invention were 
modified for improved expression in E* coli. Such modifications comprised replacing 

20 rare Arg-Arg codon pairs AGGAGG at nucleotide positions 34-39 and AGGAGA at 
nucleotide positions 64-69 (position numbering relative to SEQ ID NO:59) each with 
Arg-Arg codon pairs such as CGTCGC which are preferred in E. coli, or replacing most 
or all rare Arg codons AGA and AGG in the coding sequence with preferred E. coli Arg 
codons CGC or CGT. A methionine codon (ATG) was also added to the 5' end of the 

25 coding sequence. Exemplary coding sequences modified in such a manner include SEQ 
ID NOs: 60-61, 63-64, 66-67, 69-70, 72-73, 75-76, 78-79, 81-82, 84-85, and 87-88. The 
coding sequence was placed into the pET-42 expression vector (Novagen) under control 
of a T7 promoter with kanamycin selection marker, or into the pQE80-Kan expression 
vector (Qiagen) under the control of a T5 promoter with kanamycin selection marker. 
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lb. Protein Expression 

pET-42 vectors containing interferon coding sequences were transformed into an E. 
coli strain such as BL21(DE3) using standard methods, and plated on to agar plates 
containing 50 Jig/ml kanamycin and incubated at 37°C. After 18-24h, three separate 

5 colonies were picked and transferred into tubes containing 5 ml of 2xYT with 50 |Llg/ml 
kanamycin and incubated overnight at 37°C. The overnight culture was used to inoculate 
2 sets of flasks containing 100ml of 2xYT with 50|lg/ml kanamycin. The growth of the 
culture at 37°C was monitored at OD6oo- The culture was induced at an ODeoo of 0.5-0.8 
with 1 mM IPTG for 3h at 37°C. IPTG induced cultures were analyzed for expression by 

10 SDS-PAGE by lysing pelleted cells in SDS sample buffer. The corresponding uninduced 
sets of cultures were used to prepare frozen stocks by addition of 25% glycerol and 
freezing cells in 1 ml aliquots at -80°C. pQE80-Kan vectors containing interferon 
coding sequences were transformed into E. coli strains W31 10 or W31 10-fhuA. 
Expression was verified as described above. 

15 Larger scale shake flask expression was performed by inoculating 4xlL 2xYT media 

+kanamycin with 25ml of an overnight culture. Cultures were monitored at OD600 and 
induced with ImM IPTG at 0.5-0.8 OD units. After 3h of induction cells are harvested 
by centrifugation at 5000g and stored frozen at -80°C. Cells were disrupted using 2-3 
passes through a French press or a APV 1000 homogenizer at 10,000 psi and processed as 

20 described under "Isolation of IB" and subsequent sections. 

Fed-batch fermentation was conducted at 10L scale in a B.Braun bioreactor in 
Terrific Broth (TB) medium supplemented with trace element solution and 40mg/L 
kanamycin. Fermentation was initiated by inoculating the bioreactor with a 400 ml 
overnight culture in TB medium. During the initial growth phase the dissolved oxygen 

25 (DO) was maintained at 50% by varying the agitation rate. When the OD600 of the 
culture reached 5.0, the glycerol/amino acid feed was initiated at 0.5 ml/min and the 
agitation was set to 1000 rpm. The feed rate was adjusted to maintain the DO at 40% for 
the rest of the fermentation process. When the OD600 reached 25 the culture was 
induced by addition of IPTG to a final concentration of 1 mM. Three hours post 

30 induction the cells were harvested by centrifugation at lOOOOxg in a Beckman centrifuge 
and the cell paste was stored at -60°C The OD600 at harvest was typically around 35-40. 
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2a. £. coli Expression Construct (for high-level expression) 

The following example shows the expression of two exemplary polypeptides of the 
invention cloned into an E. coli expression system (as described in EP 0972838 ) 

5 allowing high level expression of eukaryotic genes in E. coli. Coding sequences 

corresponding to the amino acid sequences of 14epil8 (SEQ ID NO: 13) and 25epil9 
(SEQ ID NO:38), both containing an additional Met at the N-terminus, were optimized 
according to codon-usage and mRNA secondary structure (Griswold et ah, Protein Expr 
Purif. 2003 Jan;27(l): 134-42) using the sequence analysis program GCG (Genetic 

10 Computer Group, Inc., Wisconsin USA, Version 7.1, March 1992) function 
"backtranslate" / "EcoHigh-database" to prevent rare codons, and function 
MFold/Plotfold (-menu=f or -menu=g) for prediction and prevention of potential 
secondary structures detrimental to the gene expression. Coding sequences optimized in 
this manner are provided as SEQ ID:89 and SEQ ID NO:90, respectively. 

15 The optimized DNA sequences were cloned into OripBR-URA3 with an additional 

lad gene. The resulting vectors, designated OripBR-URA3-lacI_14epil8-mut2 (Fig. 6) 
and OripBR-URA3-lacI_25epil9-mut (Fig. 7), were transformed into competent E. coli 
UT5600 (ApyrF) cells (as described in EP 0972838). Transformed cells were selected on 
minimal M9-medium agar plates without addition of Uracil. 



20 M9-Agar plates: 





Sterile agarose stock solution (final cone. 15g/l) 


430 ml 




M9-salt solution 


50 ml 




1M MgS0 4 


0.5 ml 




1M CaCl 2 


0.05 ml 


25 


20% Glucose 


5 ml 




Vitamin Bl (c=l mg/ml) 


2.5 ml 




10% Casaminoacids 


25 ml 




L-Tryptophan (c= 5 mg/ml) 


5 ml 




L-Leucine (c=5 mg/ml) 


5 ml 


30 


L-Proline (c=5 mg/ml) 


2.5 ml 



2b. Protein Expression 
Transformed cells according to 2a. above were inoculated in shake flasks and 
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cultured at 37°C. Induction was performed at OD 578 =l by addition of 1 mM IPTG. Cells 
were harvested after 7 h and cell pellets were analyzed on an SDS-PAGE gel. The media 
used is as follows: 

Casaminoacids 120 g 

NH 4 C1 3.9 g 

Tryptophan 5.5 g 

Leucine 5.5 g 

Proline 5.5 g 

Glycerin 85 g 
dissolved in 4300 ml H 2 0 

and 

K 2 HP0 4 *3H 2 0 91.5 g 

Citric acid 8.0 g 

dissolved in 600 ml H20 
and 

Thiamin (5 mg/ml) 17.5 ml 

MgS0 4 x7H 2 0 (1M) 17.5 ml 
pH = 7.0 

adjust to 5000 ml with H 2 0 



3. Isolation, Solubilization, Sulfonation and Refolding of Inclusion Bodies (IB) 

For isolating IB the thawed cell paste was resuspended in 1 x PBS at 10 ml per gram 
of cell paste and mixed until a uniform slurry was obtained. The cells were disrupted by 
two passes through a microfluidizer at 17,000-19,000 psi. The cell lysate was adjusted to 
1% Triton-XlOO, mixed for 10 min and the IB were recovered by centrifugation at 
10,000g for 60 min at 2-8°C. The IB pellet was washed once by resuspending in PBS 
with 1% Triton-XlOO and recovered by centrifugation as above and stored frozen at - 
80°C. 

3a. Solubilization /Refolding Method 1: 

For solubilizing IB, the IB pellet was resuspended in Urea buffer (50 mM Tris-Cl, 
200 mM NaCl, 8 M urea, 2 mM DTT, 1 mM EDTA, pH 8.0) at 10 ml buffer per gram of 
cell paste. The suspension was mixed for 30 min and centrifuged at 10,000g for 30 min 
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at 2-8°C. The pellet was washed once by resuspending in the Urea buffer without DTT, 
centrifuged as above and washed twice with water. The washed pellet was solubilized in 
Guanidine buffer (50 mM Tris-Cl, 200 mM NaCl, 8 M guanidine-HCl, 1 mM EDTA, pH 
8.0) at 10ml buffer per gram pellet, mixed for 30-60 min and centrifuged at 10,000g for 
5 30 min at 2-8°C. The supernatant containing the solubilized IFN was adjusted to 10 

mg/ml sodium sulfite and 5 mg/ml sodium tetrathionate to initiate the sulfonation process 
which was performed at 2-8°C for 16 hours. Post-sulfonation the IFN solution was 
diluted 2-fold with water and the IFN pellet was recovered by centrifugation at 10,000g at 
2-8°C. The pellet was washed twice with water and resuspended in Guanidine buffer as 
10 above. The protein concentration of the sulfonated IFN was determined by absorbance at 
280nm. 

The refolding process was initiated by diluting the sulfonated IFN in Guanidine 
buffer at 2-8°C to a final concentration of 100 mg/ml in Refolding buffer (50 mM Tris-Cl, 
20 mM NaCl, 2 mM reduced glutathione, 1 mM oxidized glutathione). Refolding was 
15 performed at 2-8°C with slow mixing for 6-8 h, followed by addition of CuS0 4 to a final 
concentration of 2 jjM followed by an additional refolding period of 16-20 h. The 
progress of the refolding reaction was monitored by SDS-PAGE and reverse phase 
HPLC. 

3b. Solubilization /Refolding Method 2: 

20 For solubilization of 14epil8, the IB pellet was resuspended in Guanidine buffer (7 

M Guanidine, 20 mM Na 2 HP0 4 , 10 mM DTE, pH 8.0) at 4 ml buffer per gram of IB 
paste. The suspension was mixed for 2 hours and, after solubilization, diafiltered at least 8 
volumes at room temperature against a buffer containing 7 M Guanidine buffer and 20 
mM NaH 2 P0 4 pH 5.0 using 10 kDa cut-off membranes. 

25 For refolding, the solubilized protein was diluted into a folding buffer containing 20 

mM Na 2 HP0 4 , 0.8 M arginine and 2 [iM CuS0 4 , pH 7.5. Refolding was performed at 2- 
8°C with slow mixing for 72 hours at a final protein concentration of about 0.5 mg/ml 
protein. The progress of the refolding reaction was monitored by reverse phase HPLC. 
3b. Solubilization /Refolding Method 3: 

30 For solubilization of 25epil9, the IB slurry was resuspended in solubilisation buffer 

(8 M Guanidinium chloride, 100 mM DTT, 50 mM Tris pH 8.3). The suspension was 
mixed for 30 minutes at room temperature. Subsequently the pH was adjusted to 4.5 with 
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HO. The solubilisate was further concentrated using a 5 K membrane and diafiltrated 
against a 10 fold volume of 7.2 M Guanidinium chloride adjusted to pH 4.5 with HC1. 

For refolding, the solubilized protein was diluted into a folding buffer (0.8 M 
Arginine, 50 mM potassium phosphate, 2 mM GSH, 1 mM GSSG pH 7.8). Refolding was 
5 performed at 2 - 8°C overnight at a final concentration of about 1 mg/ml protein. 

4. Purification 

4a. Purification Method 1 : 

The refolded IFN was purified using three chromatography steps. The refolding 
solution was adjusted to 80% saturating concentration of ammonium sulfate 

10 (weight/volume), filtered through a 0.2 \±M filter and loaded at 200 cm/h onto a 20 ml 
Butyl Sepharose Fast Flow Hydrophobic Interaction Chromatography (HIC) column 
(Amersham Biosciences) equilibrated in Equil buffer (50mM Tris-Cl, 0.8 M ammonium 
sulfate, pH 8.0). The HIC column was washed with 8 column volumes (CV) of Equil 
buffer, followed by 8 CV of Wash buffer (50 mM Tris-Cl, 0.5 M ammonium sulfate, pH 

15 8.0). IFN was eluted with 50 mM Tris-Cl, pH 8.0 with or without 15% saturating 
concentration of ammonium sulfate. 

The HIC pool was adjusted to 50 mM sodium acetate using 0.5 M sodium acetate, 
pH 4.5 stock and diluted two fold. This adjusted pool was loaded at 90 cm/h on to a 5 ml 
HiTrap CM Sepharose Fast Flow column (Amersham Biosciences), equilibrated in 50 

20 mM sodium acetate, 100 mM NaCl, pH 5.0. Post loading the column was washed with 5 
CV under equilibration buffer conditions and IFN was eluted using a 20 CV gradient 
from 100-650 mM NaCL Peak fractions containing IFN were pooled based on 
absorbance at 280 nm. 

The CM Sepharose pool was adjusted to 20 mM 1,3 diaminopropane using a 2 M 1,3 

25 diaminopropane stock to set pH at -10. The sample was loaded at 200 cm/h on to a 5 ml 
HiTrap Q Sepharose Fast Flow column (Amersham Biosciences), equilibrated in 50 mM 
1,3 diaminopropane, 100 mM NaCl, pH 10.0. Post loading the column was washed with 
5 CV of buffer under equilibration conditions and IFN was eluted using a 20 CV gradient 
from 100-650 mM NaCl or from 100-1000 mM NaCl. The fractions containing IFN are 

30 pooled based on absorbance at 280 nm and immediately dialyzed against 250-500 

volumes of 50 mM sodium acetate, 150 mM NaCl, pH 5.0 or 50mM sodium borate, pH 
9.0. Post-dialysis the IFN samples were sterile filtered using a 0.2 jllM filter in a biosafety 
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cabinet, the concentration was measured by absorbance at 280 nm and the samples are 
stored at 2-8°C for periods up to a week or in aliquots at -80°C for extended storage. For 
activity assays the sample was generally formulated in 0.5% BSA, PBS pH 7.4 at 5, 20 
and 50ug/ml, and stored frozen in aliquots at -80°C. 
5 4b. Purification Method 2: 

Refolded 14epil8 was purified using two chromatographic steps. The refolding 
solution was adjusted to pH 8.0 and around 190 mS/cm with KC1, filtered through a 5 \im 
filter and loaded at 75 cm/h onto a Butyl-Sepharose Fast Flow Hydrophobic Interaction 
Chromatography (HIC) column (General Electric Healthcare) equilibrated in equilibration 

10 buffer consisting of 2.0 M KC1, 20 mM K 2 HP0 4 , 0.8 M arginine, pH 8.0. The HIC 

column was washed with 5 column volumes (CV) of 2.0 M KC1, 20 mM K 2 HP0 4 , 0.8 M 
arginine pH 8.0. 14epil8 was eluted with 0.4 M KC1, 20 mM K 2 HP0 4 , pH 8.0. The 
elution was monitored by reverse phase HPLC. 

The eluate was diluted with distilled water to a conductivity of around 3 mS/cm. The 

15 pH was adjusted to 6.8. This adjusted pool was loaded at 50 cm/h onto a Q-Sepharose 
Fast Flow column (GE Healthcare) equilibrated in 25 mM ammonium acetate pH 6.8. 
Post loading, the column was washed with equilibration buffer. 14epil8 was eluted with a 
linear gradient to pH 4.0. Peak fractions containing the folded 14epil8 were pooled based 
on reversed phase HPLC analytics. 

20 4c. Purification Method 3: 

Refolded 25epil9 was purified using two chromatographic steps. The refolding 
solution was adjusted to 1.3 M NaCl and the solution was filtered over a combination of 
two filters (0.8/0.3jxm followed by 0.45/0.2pim). The protein was loaded onto a Butyl 
Sepharose Fast Flow column (General Electric Healthcare) equilibrated with equilibration 

25 buffer (0.8 M Arginine, 1.3 M sodium chloride, 50 mM potassium phosphate pH 8.0) at 
50- 150 cm/h. After sample application, the column was washed with 3 CV equilibration 
buffer and 3 CV wash buffer (20 mM potassium phosphate, 2 M sodium chloride pH 7.5). 
Step elution was performed by applying elution buffer (20 mM potassium phosphate, pH 
7.5) with a flow of 50 - 100 cm/h. The peak was pooled according to UV280 

30 measurement. 

After assuring that the conductivity was below lOmS/cm, the pool was loaded at 50 
- 75 cm/h onto a Q-Sepharose Fast Flow column (GE Healthcare) equilibrated in 30 mM 
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ammonium acetate, pH 5.9. Post loading, the column was washed with equilibration 
buffer until baseline (UV280) was reached. 25epil9 was eluted with a linear gradient to 
pH 3.5. Peak fractions containing the folded 25epil9 were pooled according to reversed 
phase HPLC analytics and SDS-PAGE. 

5 

III. ACTIVITY ASSAYS 

Al. HuH7-EMCV Antiviral Assay 

Provided below is an exemplary assay for antiviral activity of interferon-alpha 
polypeptides and conjugates of the invention. The assay is a cell-based dose-response 

10 assay used to assess the anti-viral potency of a drug, and is sometimes referred to as 

"protection from cytopathic effect" (or PCPE) assay, also termed inhibition of cytopathic 
effect (CPE) assay. Briefly, cells are incubated with drug and exposed to virus. In the 
absense of drug, cells exposed to virus die. With increasing concentrations of drug, an 
increasing proportion of cells survive. The number of surviving cells can be measured 

15 directly (e.g., by visual counts) or indirectly by estimating metabolic rate. For example, 
metabolic dyes such as MTT or WST-1 may be used as an indirect measure of cell 
survival. Live cells metabolize such dyes to form metabolic products which can be 
quantified by spectrophotometry (optical density). 
Materials and Reagents: 

20 HuH7 Cells: Human hepatoma cell line (obtained from Dr. Michael Lai, USC- 

Surgery Department, LosAngeles, CA). The cell line may also be obtained from the Cell 
Bank of the Japanese Collection of Research Bioresources (JCRB) / Health Science 
Research Resources Bank (HSRRB), Osaka, Japan. The HuH7 cell line was originally 
established in the laboratory of Dr. J. Sato (Okayama University School of Medicine) 

25 from a 57-year old Japanese male with well-differentiated hepatocellular carcinoma 

(Nakabayashi, H., et al (1982) Cancer Res. 42(9):3858-63). The cell line is negative for 
Hepatitis B surface antigen. 

VERO Cells: African green monkey kidney cell line (ATCC # CCL-81). The cell 
line was derived in 1962 at Chiba University in Chiba, Japan from the kidney of a normal 

30 adult African green monkey. Passage # 93 of the VERO cell line was transferred to the 
National Institute of Health (NIH) in 1964. 
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Encephalomyocarditis virus (EMCV): tissue culture adapted strain (ATCC # VR- 
129B). A tissue culture adapted strain of encephalomyocarditis virus (EMCV) was 
purchased from the ATCC (# VR-129B). For the HuH7 antiviral assays, a high titer viral 
stock was produced in Vero African green monkey kidney cells. The viral titer of the 
5 Vero produced EMCV stock was determined by plaque assay on L929 cells to be 1.1 x 
10 8 PFU/ml. 

Complete DMEM: 

Dulbecco's Modified Eagle Medium (DMEM, Gibco Cat. No. 1 1965-092) 
10% Fetal Bovine Serum (FBS, Hyclone Cat. No. SH30071.03) 
10 1 x Penicillin-streptomycin (PS, Gibco Cat. No. 15140-122) 

Reduced Serum DMEM: 

Dulbecco's Modified Eagle Medium (DMEM, Gibco Cat. No. 1 1965-092) 
2% Fetal Bovine Serum (FBS, Hyclone Cat. No. SH3007L03) 
1 x Penicillin-streptomycin (PS, Gibco Cat. No. 15140-122) 
15 Trypsin/EDTA (Gibco Cat. No. 25300-054) 

WST-1 (Roche; Cat. No. 1 644 807) 

HuH7 cells were maintained in Complete DMEM at 37°C in a humidified 5% C0 2 
incubator. The cells were harvested with trypsin and split twice weekly when confluent 
to a final density of 1-2 x 10 6 cells per 25 ml in a Tl 75 flask. One day prior to the assay, 
20 the cells were trypsinized and seeded into new T175 flasks at a density of 4.5 x 10 6 cells 
per 25 ml to ensure that the cells were in log phase prior to the assay. 

HuH7-EMCV antiviral assay procedure: 

A high titer EMCV virus stock was amplified in VERO cells. The lethal 

25 concentration at which 95% of the cells were killed (LC95) was determined by an EMCV 
viral killing curve on HuH7 cell monolayers. Briefly, HuH7 cells were plated on day 
one in 96-well microtiter plates at 6xl0 4 cells per well. Virus was serial diluted 1:3 in 
DMEM+ 2% FBS with 10 dilution points and added to the cells on day two. Twenty-four 
hours post-infection, cell survival was determined by a tetrazolium salt metabolism assay, 

30 WST-1 (Roche). The LC95 determined for the HuH7-EMCV assay corresponded to an 
MOI of 0.034 (PFU/cell). Titer of the virus stock was determined by a standard plaque 
assay on L929 cells. 
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On day one of the assay, log phase HuH7 cells were harvested with trypsin, 
resuspended in Reduced Serum DMEM and concentrated by centrifugation. The cell 
pellets, corresponding to 5 T175 flasks of cells, were resuspended in 10 ml of Reduced 
Serum DMEM, filtered through a 40 micron Nylon cell strainer and counted with a 
hemocytometer. Cell viability was determined by trypan blue exclusion. The cells were 
resuspended in Reduced Serum DMEM to a final density of 6xl0 5 cells/mL One hundred 
microliters of the diluted cells were added to each well of a 96- well assay plates (6xl0 4 
cells/well) and the plates were incubated at 37°C in a humidified 5% CO2 incubator for 4 
hours. 

The potencies of "reference" interferon alphas and interferon-alpha polypeptides of 
the invention (also called "test samples") were determined by dose-response analysis. 
There was generally one test sample and one reference IFN-alpha per plate, each with 
three replicate curves of IFN-alpha treated/EMCV challenged cells and two replicate 
curves of IFN-alpha treatment alone. The later was assayed to control for potential 
antiproliferative effects of IFN-alpha on HuH7 cells. The dose-response curves for the 
reference IFN-alphas generally consisted of 8 three-fold dilutions ranging from 100 ng/ml 
to 0.05 ng/ml. For the IFN-alpha test samples, the three-fold dilutions generally ranged 
from 5 ng/ml to 0.002 ng/ml. Eight wells each of cells treated with virus but no IFN- 
alpha and cells alone were also run as controls. 

The IFN-alpha dilutions were prepared using Reduced Serum DMEM. One hundred 
microliters of the diluted IFN-alpha preparations were transferred to the assay plates. The 
assay plates were incubated at 37°C in a humidified 5% CO2 incubator for 16 hours. 

On day two, the cells were challenged with EMC virus. Medium was aspirated from 
each well of the assay plate. The EMCV stock was diluted 1:5400 in DMEM + 2%FBS. 
One hundred microliters of the diluted virus, corresponding to 0.034 viral particles per 
cell initially seeded, was added to each well. The cells were incubated with virus at 37°C 
in a humidified 5% C0 2 incubator for 24 hours. 

On day three, the number of viable cells in each well was quantified by WST-1 
assay. Medium was aspirated from each well of the assay plate. The WST- 1 reagent was 
diluted 1:20 in Reduced Serum DMEM, 100 (xl of the diluted WST-1 reagent was added 
to each well and the cells were incubated at 37°C in a humidified 5% C0 2 incubator for 
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60 minutes. The number of viable cells in each well was quantified by measuring OD at 
450 nm on a plate reader. 

Analysis: 

5 The antiviral potency of the IFN-alpha reference and test samples were calculated 

with the equation: 

Antiviral potency =(Viable cellsc+i+v - Viable cellsc+v)/(Viable cellsoi-Viable 
cells c+ v)*100% 

where C+V=HuH7 cells+EMCV, C+I=HuH7 cells+IFN-a, and C+I+V=HuH7 
10 cells+IFN-a+EMCV 

Dose-response curves were analyzed by non-linear regression using GraphPad Prism 
4 (GraphPad Software Inc.) The following equation was used for the curve fits: 



w ^ (Top- Bottom) 

Y = Bottom+ K H ' 



l 1 0 (LogEC50-X) HillSlope 



15 Bottom is the Y value at the bottom plateau; Top is the Y value at the top plateau, 

and LogECSO is the X value when the response is halfway between Bottom and Top. The 
Levenberg-Marquardt method was used as optimization algorithm. 



A2. HeLa-EMCV and WISH-EMCV Antiviral Assays 
20 The inhibition of the cytopathic effect of EMCV on HeLa (a human cervical 

carcinoma derived cell line) and WISH (a human amniotic tissue derived cell line) cells 
also provide useful surrogate assays to assess the antiviral potencies of interferon alpha 
polypeptides and conjugates of the invention. These assays were performed in a similar 
fashion to the HuH7-EMCV assay described above, with the following modifications. 

25 

Materials and Reagents: 

HeLa Cells: Human cervical carcinoma cell line (ATCC # CCL-2). The HeLa cell 
line was derived from a cervical adenocarcinoma from a 35-year-old female. The HeLa 
cells are reported to contain human papilloma virus (HPV-18) sequences. The cell line 
30 was received from the ATCC at passage number 107. 
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WISH Cells: Human epithelial cell line (ATCC # CCL-25). The WISH cell line was 
established from altered colonies appearing in a subculture of primary human amniotic 
cells after 35 days in culture. The WISH cell line contains HeLa marker chromosomes. 
The cells were received from the ATCC at passage # 170. 
5 L-929 Cells: Murine fibroblast cell line (ATCC # CCL-1). The L-929 cell line is a 

single cell clone isolated from the 95 th passage of the parental strain, Strain L, in 1948. 
Strain L was isolated from the subcutaneous areolar and adipose tissue of a normal 100 
day old male C[3]H/An mouse. 

EMCV: tissue culture adapted strain (ATCC # VR-129B). A tissue culture 
10 adapted strain of encephalomyocarditis virus (EMCV) was purchased from the ATCC (# 
VR-129B). For the HeLa and WISH antiviral assays, the high titer EMCV stock was 
produced in L929 cells. The viral titer of the L929 produced viral stock was 4.0 X 10 8 
PFU/ml based on the results of a plaque assay on L929 cells. 
Complete MEM: 

15 Minimal Essential Media (MEM, Gibco Cat. No. 10370-021) 

10% Fetal Bovine Serum (FBS, Hyclone Cat. No. SH30071.03) 
lx Penicillin-streptomycin-glutamine (PSG, Gibco Cat. No. 10378-016) 
ImM Sodium pyruvate (Gibco Cat. No. #1 1360-070) 
Reduced Serum MEM: 
20 Minimal Essential Media (MEM, Gibco Cat. No. 10370-021) 

2% Fetal Bovine Serum (FBS, Hyclone Cat. No. SH30071.03) 
lx Penicillin-streptomycin-gluatimine (PSG, Gibco Cat. No. 10378-016) 
ImM Sodium pyruvate (Gibco Cat. No. #1 1360-070) 
HeLa cells were maintained in Complete MEM at 37°C in a humidified 5% CO2 
25 incubator. The cells were harvested with trypsin and split twice weekly when confluent 
to a final density of 2-4 x 10 6 cells per 25 ml in a Tl 75 flask. One day prior to the assay, 
the cells were trypsinized and seeded into new T175 flasks at a density of 6 x 10 6 cells per 
25 ml to ensure that the cells were in log phase prior to the assay. 

WISH cells were maintained in Complete MEM at 37°C in a humidified 5% C0 2 
30 incubator. The cells were harvested with trypsin and split twice weekly when confluent 
to a final density of 5-6 x 10 6 cells per 25 ml in a T 175 flask. One day prior to the assay, 
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the cells were trypsinized and seeded into new T175 flasks at a density of 10 x 10 6 cells 
per 25 ml to ensure that the cells were in log phase prior to the assay. 

HeLa-EMCV antiviral assay procedure: 
5 On day one of the assay, log phase HeLa cells were harvested with trypsin, 

resuspended in Reduced Serum MEM and concentrated by centrifugation. The cell 
pellets, corresponding to 5 T175 flasks of cells, were resuspended in 10 ml of Reduced 
Serum MEM, filtered through a 40 micron Nylon cell strainer and counted with a Coulter 
Counter. Cell viability was determined by trypan blue exclusion with a hemocytometer. 

10 The cells were resuspended in Reduced Serum MEM to a final density of lxlO 5 cells/ml. 
One hundred microliters of the diluted cells were added to each well of a 96-well assay 
plates (lxlO 4 cells/well) and the plates were incubated at 37°C in a humidified 5% CO2 
incubator for 24 hours. 

The potencies of "reference" interferon alphas and interferon-alpha polypeptides of 

15 the invention (also called "test samples") were determined by dose-response analysis. 
There was generally one test sample and one reference IFN-alpha per plate, each with 
three replicate curves of IFN-alpha treated/EMCV challenged cells and two replicate 
curves of IFN-alpha treatment alone. The later was assayed to control for potential 
antiproliferative effects of IFN-alpha on HeLa cells. The dose-response curves for the 

20 reference IFN-alphas generally consisted of 8 five-fold dilutions ranging from 60 ng/ml to 
0.0008 ng/ml. For the IFN-alpha test samples, the five-fold dilutions generally ranged 
from 5 ng/ml to 0.00006 ng/ml. Eight wells each of cells treated with virus but no IFN- 
alpha and cells alone were also run as controls. 

The IFN-alpha dilutions were prepared using Reduced Serum MEM. One hundred 

25 microliters of the diluted IFN-alpha preparations were transferred to the assay plates. The 
assay plates were incubated at 37°C in a humidified 5% CO2 incubator for 16 hours. 

On day three, the cells were challenged with EMC virus. Medium was aspirated 
from each well of the assay plate. The EMCV stock was diluted 1:133 in MEM + 
2%FBS. One hundred microliters of the diluted virus, corresponding to 3 X 10 5 viral 

30 particles was added to each well. The cells were incubated with virus at 37°C in a 
humidified 5% CO2 incubator for 24 hours. 
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On day four, the number of viable cells in each well was quantified by WST-1 assay. 
Medium was aspirated from each well of the assay plate. The WST-1 reagent was diluted 
1:20 in Reduced Serum MEM, 100 |ll of the diluted WST-1 reagent was added to each 
well and the cells were incubated at 37°C in a humidified 5% CO2 incubator for 60 
5 minutes. The number of viable cells in each well was quantified by measuring OD at 450 
nm on a plate reader. 

WISH-EMCV antiviral assay procedure: 

On day one of the assay, log phase WISH cells were harvested with trypsin, 

10 resuspended in Reduced Serum MEM and concentrated by centrifugation. The cell 

pellets, corresponding to five T175 flasks of cells, were resuspended in 10 ml of Reduced 
Serum MEM, filtered through a 40 micron Nylon cell strainer and counted with a Coulter 
Counter. Cell viability was determined by trypan blue exclusion with a hemocytometer. 
The cells were resuspended in Reduced Serum MEM to a final density of 3xl0 5 cells/ml. 

15 One hundred microliters of the diluted cells were added to each well of a 96-well assay 
plates (3xl0 4 cells/well) and the plates were incubated at 37°C in a humidified 5% C0 2 
incubator for 4 hours. 

The potencies of "reference" interferon alphas and interferon-alpha polypeptides of 
the invention (also called "test samples") were determined by dose-response analysis. 

20 There was generally one test sample and one reference IFN-alpha per plate, each with 
three replicate curves of IFN-alpha treated/EMCV challenged cells and two replicate 
curves of IFN-alpha treatment alone. The later was assayed to control for potential 
antiproliferative effects of IFN-alpha on HeLa cells. The dose-response curves for the 
reference IFN-alphas generally consisted of 8 ten-fold dilutions ranging from 100 ng/ml 

25 to 0.0001 ng/ml. For the IFN-alpha test samples, the five-fold dilutions generally ranged 
from 5 ng/ml to 0.00006 ng/ml. Eight wells each of cells treated with virus but no IFN- 
alpha and cells alone were also run as controls. 

The IFN-alpha dilutions were prepared using Reduced Serum MEM. One hundred 
microliters of the diluted IFN-alpha preparations were transferred to the assay plates. The 

30 assay plates were incubated at 37°C in a humidified 5% CO2 incubator for 16 hours. 

On day two, the cells were challenged with EMC virus. Medium was aspirated from 
each well of the assay plate. The EMCV stock was diluted 1:267 in MEM + 2%FBS. One 
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hundred microliters of the diluted virus, corresponding to an MOI of approximately 5 was 
added to each well. The cells were incubated with virus at 37°C in a humidified 5% CO2 
incubator for 24 hours. 

On day three, the number of viable cells in each well was quantified by WST- 1 
5 assay. Medium was aspirated from each well of the assay plate. The WST-1 reagent was 
diluted 1:20 in Reduced Serum MEM, 100 |Lil of the diluted WST-1 reagent was added to 
each well and the cells were incubated at 37°C in a humidified 5% C0 2 incubator for 60 
minutes. The number of viable cells in each well was quantified by measuring OD at 450 
nm on a plate reader. 

10 

B. T H 1 Differentiation Assay 

Provided below is an exemplary assay for T H 1 differentiation activity of interferon- 
alpha polypeptides and conjugates of the invention. 

Assay Procedure: 

15 Human buffy coats (25-30 ml) containing leukocytes and erythrocytes prepared from 

500 ml blood were collected from Stanford Blood Bank the day of assay initiation (one 
day before the assay day) and kept at room temperature overnight. Each buffy coat was 
carefully transferred to a T75 flask and diluted to 100 ml with PBS. For each buffy coat, 
13 ml of Histopaque/Ficoll (Sigma H8889) was pipetted into four 50 ml centrifuge tubes, 

20 and 25 ml of diluted blood sample was carefully overlaid on top of the Histopaque/Ficoll 
without disrupting the interface. The tubes were then centrifuged (20°C, 2500 rpm) for 20 
minutes. Using 3 ml plastic transfer pipettes the mononuclear cell layer containing 
PBMCs was transferred to two 50ml conical tubes (cells from 2 Histopaque/Ficoll/buffy 
coat tubes to one tube). The PBMCs were then diluted to 50 ml/tube with PBS and 

25 centrifuged (20°C, 1000 rpm) for 10 minutes to remove platelets. After removal of PBS, 
the PBMCs and remaining RBCs were mixed to prevent aggregation. Ten milliliters of 
RBC lysis buffer (ammonium chloride buffer) were added and two tubes of cells were 
combined to one tube. Each tube, now containing the total PBMC and RBC isolate from 
one donor, was incubated at room temperature for 10 min. Potential clots of blood cells 

30 were removed by filtering the cells with a cell strainer (70um, Falcon Cat. No. 2350). 
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PBS was added to a total volume of 50 ml followed by centrifugation (20°C, 1000 rpm) 
for lOmin. The cell number was finally counted using a hemocytometer. 

Next, a fraction of each PBMC preparation was prestained and analyzed by FACS to 
select PBMC preparations with a percentage of naive T H 0 cells above 15%. Three 
5 hundred microliters of PBMCs were stained with 20 \xl FITC-conjugated anti-human 
CD45RA (Pharmigen, Cat. No. 555488), 20 |ul Cy-chrome conjugated anti-human CD4 
(Pharmigen, Cat. No. 555348), 5 (ill PE-conjugated anti-human CD8 (Pharmigen, Cat. No. 
555367), 5 |jl PE-conjugated anti-human CD 14 (Pharmigen, Cat. No.555398), and 5 ]ui 
PE-conjugated anti-human CD20 (Pharmigen, Cat. No. 555623), and incubated on ice for 

10 45 minutes. The cells were washed with PBS, resuspended in 1 ml PBS, and filtered with 
a 40 |Lim cell strainer (Falcon, Cat. No. 2340). The percentage of naive T H 0 cells (positive 
to CD4 and CD45RA and negative to CD8, CD 14, CD20) were quantified by FACS, and 
PBMC preparations with more than 15% naive T H 0 cells were selected for the assay. 
The selected PBMC preparations were stained with 500 |il FITC-conjugated anti- 

15 human CD45RA, 500 |LLl Cy-chrome conjugated anti-human CD4, 200 jlxI PE-conjugated 
anti-human CD8, 50 |al PE-conjugated anti-human CD 14, and 50 (il PE-conjugated anti- 
human CD20, and incubated on ice for 60 minutes. The cells were washed with PBS, PI 
was added, and the cells were diluted with 10ml of PBS followed by filtering with a 40 
[im cell strainer. The cells were FACS sorted, and lxlO 4 naive T H 0 cells (positive to CD4 

20 and CD45RA and negative to CD8, CD 14, CD20) were transferred by MOFLO into each 
well of 96 well round bottom plates, containing 160 \xl DMEM plus Penicillin- 
streptomycin plus 2 mM Glutamine and 10% Fetal Bovine Serum (Hyclone Cat. No. 
SH30071.03). 

One point seven microliters of Dynabeads CD3/CD28 T cell expander (Dynal, Cat. 

25 No. 1 1 1.32) were added to each well. The stimulatory effect of the Dynabeads was 

calibrated prior to the experiment to avoid lot-to-lot variance. Next, 20 jil/well of protein 
samples were added to the assay plates. Generally, concentration ranges for IL-4 and IL- 
12 standards (obtained from R&D Systems) were from 0.04 pg/ml to 10 ng/ml, and 
concentration ranges for IFN-alpha test samples and IFN- alpha reference sample were 

30 from 0.76 pg/ml to 200 ng/ml. 
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The cells were incubated at 37°C in a humidified 5% CO2 incubator for 6 days. 
Supernatants from each well were harvested to determine the degree of T H 1 
differentiation through quantification of the IFN-y content, using a standard ELISA (R&D 
Systems, Cat. # DIF50). 

Analysis: 

Response = IFN-y concentrations in pg/ml . 

The following equation was used for curve fitting: 

Y = Bottom. (TOP -Bottom) 



10 (LogEC50-X)-HillSlope 



The variable Bottom is the Y value at the bottom plateau; Top is the Y value at the 
top plateau, and LogEC50 is the X value when the response is halfway between Bottom 
and Top. The Levenberg-Marquardt method was used as optimization algorithm. 



C. Daudi Antiproliferation Assay 

Provided below is an exemplary assay for antiproliferative activity of interferon- 
alpha polypeptides and conjugates of the invention. 
Materials: 

Daudi Cells: Human Burkitt's lymphoma cells (ATCC Number: # CCL-213). The 
Daudi line was derived from a 16-year-old black male with Burkitt's lymphoma in 1967. 
The cell line is positive for Epstein-Barr virus. 

Complete RPMI: 

RPMI 1640 (RPMI, Gibco Cat. No. 11875-143) 

10% Fetal Bovine Serum (FBS, Hyclone Cat. No. SH30071.03) 

lx Penicillin-streptomycin-gluatimine (PSG, Gibco Cat. No. 10378-01 
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Daudi Burkitt's lymphoma cells grown in suspension were maintained in T175 tissue 
culture flasks, containing 50ml of complete RPMI at 37°C in a humidified 5% CO2 
incubator. The cells were split 1:10 when confluent. 

Assay procedure: 

Daudi cells were spun down and washed with lxPBS. The cell number was adjusted 
to 10 5 cells/ml. 80 jxl culture medium was added to each well in 96 well round bottom 
assay plates followed by transfer of 100 |al cells (10 4 cells/well) to each well. 

Eleven dilutions of the IFN-alpha reference material and IFN-alpha test samples, 
ranging from 200 ng/ml to 0.2 pg/ml (4-fold dilutions), were prepared in dilution plates 
using culture medium. Twenty jal of the diluted IFN-alpha preparations were then 
transferred to the assay plates. 

The cells were incubated at 37°C in a humidified 5% CO2 incubator. After 48 hours, 
l\xd of methyl- 3 H thymidine (Amersham Pharmacia, Cat. No. TRK758) was added to 
each well followed by incubation for 24 hours at 37°C in a humidified 5% C0 2 incubator. 
The cells were harvested on the following day and incorporation of thymidine was 
determined. 

Alternatively, the Daudi antiproliferation assay may be performed by measuring the 
number of viable, metabolically active Daudi cells based on the amount of ATP present in 
a culture well following interferon treatment. In this procedure, 8000 Daudi cells are 
plated in a volume of 50 jxl in a 96 well round bottom assay plate. The cells are allowed 
to incubate at 37°C for 2 hours in a 5% C0 2 incubator prior to the addition of the 
interferon dilutions. The interferons are diluted 6-fold (9-points) generally starting from 
5-10 ng/ml. Following addition of the interferons, the cells are incubated for 72 hours at 
37°C in a humidified 5% C0 2 incubator. The number of viable cells in each well is then 
determined by the addition of 100 jiil of CellTiter-Glo™ (Promega, Cat #G7572). The 
cells plus CellTiter-Glo™ reagent are mixed for 2 minutes and incubated at room 
temperature in the dark for 30-60 minutes. The amount of luminescent signal for each 
well is determined using an Analyst HT instrument. 
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Analysis: 

The EC50 of the IFN-alpha reference and samples were calculated using the 
equation: 

(Top - Bottom) 



Y = Bottom+ 



10 (LogEC50-X).HillSlope 



5 where Bottom is the Y value at the bottom plateau; Top is the Y value at the top 

plateau, and LogEC50 is the X value when the response is halfway between Bottom and 
Top. The Levenberg-Marquardt method was used as optimization algorithm. 

EXAMPLE 1: DETERMINATION OF SURFACE-ACCESSIBLE RESIDUES OF 
1 0 INTERFERON- ALPHAS 

Surface exposure of human interferon- a 2a residues: 

Based on the 24 NMR structures of human interferon-alpha 2a reported by Klaus et 
ah, J. Mol Biol, 274: 661-675 (1997), the fractional ASA of side chains was calculated. 
The sequence numbering used below is based on the mature sequence of the human 

15 interferon-alpha 2a protein (identified herein as SEQ ID NO:47). It is noted that this 
structure contains two disulphide bridges involving Cysl-Cys98 and Cys29-Cysl38, 
respectively. By computing the ASA and the fractional ASA and taking the average of the 
24 structures, focusing on the ASA of the side chains, it was determined that the 
following residues have more than 25% fractional ASA: D2, L3, P4, Q5, T6, H7, S8, L9, 

20 G10, R12, R13, M16, A19, Q20, R22, K23, 124, S25, L26, F27, S28, L30, K31, R33, 
H34, D35, G37, Q40, E41, E42, G44, N45, Q46, Q48, K49, A50, E51, E58, Q61, Q62, 
N65, S68, T69, K70, D71, S73, A74, D77, E78, T79, L80, D82, K83, T86, Y89, Q90, 
N93, D94, E96, A97, V99, 1100, Q101, G102, V103, G104, T106, E107, T108, P109, 
L110, Mill, K112, E113, D114, LI 17, R120, K121, Q124, R125, T127, L128, K131, 

25 E132, K133, K134, Y135, S136, P137, C138, A145, M148, R149, S152, L153, N156, 
Q158, E159, S160, L161, R162, S163, K164 and E165, with position numbering relative 
to that of the interferon-alpha 2a sequence identified herein as SEQ ID NO:47. 

The following residues were determined to have on average more than 50% 
fractional ASA of their side chain: D2, L3, P4, Q5, T6, H7, S8, L9, R12, R13, M16, A19, 

30 S25, F27, S28, K31, R33, H34, D35, G37, E41, G44, N45, Q46, Q48, K49, N65, K70, 
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A74, D77, E78, T79, D82, K83, T86, Y89, Q90, N93, D94, 1100, Q101, G102, G104, 
T106, E107, T108, P109, LI 10, El 13, Dl 14, LI 17, R120, K121, Q124, R125, L128, 
K131, E132, K134, P137, R149, E159, L161, R162, S163, K164 and E165, with position 
numbering relative to that of the interferon-alpha 2a sequence identified herein as SEQ ID 
5 NO:47. 

Surface exposure of residues corresponding to SEQ ID NO:l: 

Owing to an insertion of an amino acid after position 44 of the human interferon- 
alpha 2 subtypes - such as, for example, interferon-alpha 2b (SEQ ID NO:46) and 

10 interferon-alpha 2a (SEQ ID NO:47) — in many interferon alpha sequences, including all 
of the known human interferon alpha sequences (apart from the IFN-alpha 2 subtypes) 
and certain polypeptides of the invention, the position numbering of the surface-exposed 
residues will be shifted by one residue past position number 44 in, for example, the 
sequences shown in the alignment Figure 2, relative to the numbering of the sequence 

15 denoted hlFNalpha 2b (SEQ ID NO:46). 

Based on the above analysis, the following positions, numbered relative to SEQ ID 
NO:l, are considered to contain amino acid residues having more than 25% fractional 
ASA: positions 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 13, 16, 19, 20, 22, 23, 24, 25, 26, 27, 28, 30, 
31, 33, 34, 35, 37, 40, 41, 42, 44, 46, 47, 49, 50, 51, 52, 59, 62, 63, 66, 69, 70, 71, 72, 74, 

20 75, 78, 79, 80, 81, 83, 84, 87, 90, 91, 94, 95, 97, 98, 100, 101, 102, 103, 104, 105, 107, 
108, 109, 110, 111, 112, 113, 114, 115, 118, 121, 122, 125, 126, 128, 129, 132, 133, 134, 
135, 136, 137, 138, 139, 146, 149, 150, 153, 154, 157, 159, 160, 161, 162, 163, 164, 165, 
and 166. 

Likewise, the following positions, again numbered relative to SEQ ID NO: 1, are 
25 considered contain amino acid residues having on average more than 50% fractional ASA 
of their side chain: 2, 3, 4, 5, 6, 7, 8, 9, 12, 13, 16, 19, 25, 27, 28, 31, 33, 34, 35, 37, 41, 
44, 46, 47, 49, 50, 66, 71, 75, 78, 79, 80, 83, 84, 87, 90, 91, 94, 95, 101, 102, 103, 105, 
107, 108, 109, 110, 111, 114, 115, 118, 121, 122, 125, 126, 129, 132, 133, 135, 138, 150, 
160, 162, 163, 164, 165, and 166. 
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EXAMPLE 2: ANTIVIRAL ACTIVITES OF INTERFERON-ALPHA 
POLYPEPTIDES 

Patients with chronic HCV infection have initial viral loads in the range of 10 4 -10 7 
copies of HCV RNA/ml. Upon treatment with IFN-alpha, the viral load characteristically 
undergoes two distinct log-linear phases of decline reflecting two distinct mechanisms 
(Figure IB). The initial drop in viral load occurs in about the first two days and is 
believed to be due to the reduction in rate of virus production by infected liver cells in the 
face of the IFN-alpha therapy. 

The major technical challenge with HCV is that the virus cannot be grown in vitro 
and has only recently been cultured in tractable animal models. There are, however, 
viruses that replicate in vitro which are considered to be useful surrogates for HCV viral 
replication. In vitro surrogate assays believed to be predictive of in vivo HCV antiviral 
activity include the assays described in the Materials and Methods section above, which 
measure the ability of test molecules to protect cells from the cytopathic effect (CPE) of 
viral infection, using Encephalomyocarditis virus (EMCV) in the human cell lines HuH7 
(hepatoma derived), HeLa (cervical carcinoma derived) and WISH (amniotic tissue 
derived). 

Table 7 below provides relative antiviral activities of various IFN-alpha polypeptides 
in representative CPE assays performed as described in the Materials and Methods 
section above. Significant differences in activity were determined by TwoWay ANOVA 
(factors: sample, experiment) (a = 0.05) followed by Bonferroni's post-hoc test for mean 
comparisons (a = 0.05) or Fisher's post-hoc test. The symbol (+/-) indicates antiviral 
activity which is statistically indistinguishable from that of the reference interferon; ND, 
not determined. 



Table 7 



Sample Name 


SEQ ID NO 


HuH7/EMCV 
Antiviral activity 

relative to 
hulFN-alpha 2b 


HeLa/ EMCV 
Antiviral activity 

relative to 
hulFN-alpha 2b 


B9X14EC4 


SEQ ID NO:1 


> 8x 


> 8x 


14EPI15 


SEQ ID NO:10 


> 8x 


> 8x 


14EPI16 


SEQ ID NO:11 


> 8x 


> 8x 


14EPI17 


SEQ ID NO:12 


> 8x 


> 8x 


14EPI18 


SEQ ID NO:13 


> 8x 


> 8x 
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Sample Name 


SEQ ID NO 


HuH7/EMCV 
Antiviral activity 

relative to 
huIFN-alpha 2b 


HeLa/ EMCV 
Antiviral activity 

relative to 
huIFN-alpha 2b 


14EPI19 


SEQ ID NO:14 


> 8x 


> 8x 


14EPI20 


SEQ ID NO:15 


> 8x 


> 8x 


14EPI21 


SEQ ID NO:16 


> 8x 


> 8x 


14EPI25 


SEQ ID NO:20 


> 8x 


> 8x 


14EPI32 


SEQ ID NO:27 


> 8x 


> 8x 


14EPI33 


SEQ ID NO:28 


> 8x 


> 8x 


14EPI34 


SEQ ID NO:29 


> 8x 


> 8x 


25EPI19 


SEQ ID NO:38 


> 8x 


> 8x 


14EPI22 


SEQ ID NO:17 


4x to 8x 


> 8x 


14EPI26 


SEQ ID NO:21 


4x to 8x 


> 8x 


14EPI29 


SEQ ID NO:24 


4x to 8x 


> 8x 


14EPI31 


SEQ ID NO:26 


4x to 8x 


> 8x 


14EPI23 


SEQ ID NO:18 


4x to 8x 


4x to 8x 


14EPI24 


SEQ ID NO:19 


4x to 8x 


4x to 8x 


25EPI08 


SEQ ID NO:36 


4x to 8x 


4x to 8x 


25EPI26 


SEQ ID NO:41 


4x to 8x 


4x to 8x 


14EPI42 


SEQ ID NO:35 


4x to 8x 


2x to 4x 


25EPI18 


SEQ ID NO:37 


4x to 8x 


2x to 4x 


25EPI25 


SEQ ID NO:40 


4x to 8x 


2x to 4x 


14EPI30 


SEQ ID NO:25 


2x to 4x 


4x to 8x 


25EPI28 


SEQ ID NO:43 


2x to 4x 


2x to 4x 


14EPI27 


SEQ ID NO:22 


2x to 4x 


2x to 4x 


14EPI35 


SEQ ID NO:30 


2x to 4x 


ND 


14EPI36 


SEQ ID NO:31 


2x to 4x 


ND 


14EPI37 


SEQ ID NO:32 


2x to 4x 


ND 


14EPI38 


SEQ ID NO:33 


2x to 4x 


ND 


25EPI20 


SEQ ID NO:39 


2x to 4x 


0.5x to 2x 


14EPI28 


SEQ ID NO:23 


0.5x to 2x 


0.5x to 2x 


25EPI27 


SEQ ID NO:42 


0.1 x to 0.5x 


0.1x to 0.5x 


IFN-alpha Con1 


SEQ ID NO:58 


2x to 4x 


4x to 8x 


IFN-alpha 2a 


SEQ ID NO:47 


0.5x to 2x 


0.5x to 2x 


IFN-alpha 2b 


SEQ ID NO:46 


1 (ref) 


1 (ref) 
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EXAMPLE 3: T H 1 ACTIVITES OF INTERFERON- ALPHA POLYPEPTIDES 

The T H 1 assay was designed to monitor the enhancement of differentiation of naive 
T cells into T H 1 cells by IFN-alpha as determined by the amount of IFN-gamma detected 
in the culture supernatants. Table 8 provides relative T H 1 activities of various IFN-alpha 
polypeptides assayed as described in the Materials and Methods section above. 



Table 8 



Sample 


SEQIDNO 


Fold over 
IFN-alpha 2a 


14Epi18 


SEQ ID NO:13 


> 30x 


25Epi19 


SEQ ID NO:38 


> 30x 


IFN-alpha Con1 


SEQ ID NO:58 


10x to 15x 


IFN-alpha 2a 


SEQ ID NO:47 


1 (ref) 



EXAMPLE 4: ANTIPROLIFERATIVE ACTIVITIES OF INTERFERON-ALPHA 
POLYPEPTIDES 

IFN-alpha inhibits proliferation of many cell types, although the antiproliferative 
effects often occur at higher doses than are required for the antiviral response. Daudi 
cells are a human derived EVB -transformed B cell line that is IFN-alpha sensitive. This 
IFN-alpha responsive cell line serves as a useful probe of the antiproliferative effects of 
the IFN-alpha polypeptides of the invention. Furthermore, antiproliferative activity of 
IFN-alpha on megakaryocytes and neutrophils at high dose is believed to contribute to 
thrombocytopenia and neutropenia, respectively. The Daudi antiproliferation assay may 
serve as a useful surrogate assay for antiproliferative effects on these other lymphoid cell 
types. 

Table 9 provides relative antiproliferative activities of various IFN-alpha 
polypeptides assayed as described in the Materials and Methods section above, expressed 
as fold antiproliferative activity relative to huIFN-alpha 2a ( EC 50 huIFN-alpha 2a / EC 50 
sample). Under the assay conditions employed, antiproliferative activities at least O.Olx 
of the reference interferon were detectable. 



168 



WO 2005/113592 



PCT/US2005/017471 



Table 9 



sample name 


ccn in MO- 


Fold activity 
vs. IFNa-2a 


B9X14EC4 


SEQIDNO:1 


0.8x to 0.5x 


14Epi15 


SEQ ID NO:10 


0.25x to 0.1 x 


14Epi17 


SEQ ID NO:12 


0.25x to 0.1 x 


14Epi18 


SEQ ID NO:13 


0.8x to 0.5x 


14Epi19 


SEQ IDNO:14 


0.5x to 0.25x 


14Epi20 


SEQ ID NO:15 


0.5x to 0.25x 


14Epi21 


SEQ ID NO:16 


0.25x to 0.1 x 


14Epi22 


SEQIDNO:17 


0.25x to 0.1 x 


14Epi23 


SEQ ID NO:18 


0.25x to 0.1 x 


14Epi24 


SEQ IDNO:19 


0.5x to 0.25x 


14Epi25 


SEQ ID NO:20 


0.5x to 0.25x 


14Epi26 


SEQ ID NO:21 


0.5x to 0.25x 


14Epi27 


SEQ ID NO:22 


0.5x to 0.25x 


14Epi28 


SEQ ID NO:23 


0.8x to 0.5x 


14Epi29 


SEQ ID NO:24 


0.25x to 0.1 x 


14Epi30 


SEQ ID NO:25 


0.5x to 0.25x 


14Epi31 


SEQ ID NO:26 


0.5x to 0.25x 


14Epi32 


SEQ ID NO:27 


1 .2x to 0.8x 


14Epi33 


SEQ ID NO:28 


0.8x to 0.5x 


14Epi34 


SEQ ID NO:29 


0.8x to 0.5x 


14Epi35 


SEQ ID NO:30 


0.8x to 0.5x 


14Epi36 


SEQ ID NO:31 


0.5x to 0.25x 


14Epi37 


SEQ ID NO:32 


0.8x to 0.5x 


14Epi38 


SEQ ID NO:33 


0.8x to 0.5x 


14Epi42 


SEQ ID NO:35 


0.5x to 0.25x 


25Epi08 


SEQ ID NO:36 


0.5x to 0.25x 


25Epi18 


SEQ ID NO:37 


0.25x to 0.1 x 


25Epi19 


SEQ ID NO:38 


0.8x to 0.5x 


25Epi20 


SEQ ID NO:39 


0.25x to 0.1 x 


25Epi25 


SEQ ID NO:40 


0.25x to 0.1 x 


25Epi26 


SEQ ID NO:41 


0.25x to 0.1 x 


25Epi27 


SEQ ID NO:42 


0.5x to 0.25x 


25Epi28 


SEQ ID NO:43 


0.5x to 0.25x 


25Epi29 


SEQ ID NO:44 


0.8x to 0.5x 
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sample name 


SEQ ID NO: 


Fold activity 
vs. IFNa-2a 


IFN-alpha Con1 


SEQ ID NO:58 


1 .2x to 0.8x 


IFN-alpha 2a 


SEQ ID NO:47 


1x (ref) 



EXAMPLE 5: PEGYLATION OF INTERFERON- ALPHA POLYPEPTIDES 

The following provides exemplary procedures for preparing conjugates of the invention. 

Cys-PEGylation 

5 A polypeptide of the invention which contains a free cysteine may be cysteine- 

PEGylated as follows. The polypeptide is first partially reduced with an equimolar 
concentration of TCEP (Triscarboxyethylphosphine) at 4°C for 30 min in 50 mM MES, 
100 mM NaCl, pH 6.0. The reduced polypeptide is then reacted with a 4 fold molar 
excess of mPEG-MAL reagent (with a PEG moiety such as a 20 kDa or 30 kDa linear 

10 mPEG, or a 40 kDa branched mPEG2) for lh at 4°C under the same conditions. The 

PEGylated reaction mixture is loaded on to a SP-Sepharose HP column equilibrated with 
50mM MES, pH 6.0, lOOmM NaCl. After a 10 CV (column volume) wash step a 
gradient from 0-600 mM NaCl is applied to fractionate the PEGylated and unPEGylated 
fractions. Fractions are collected and aliquots are analyzed by SDS-PAGE. Fractions 

15 containing monoPEGylated species are pooled and formulated for assays for interferon- 
alpha activity as described above. 

N-terminal PEGylation 

N-terminal PEGylation with mPEG-butyraldehyde (having a PEG moiety such 
as a 20 kDa or 30 kDa linear mPEG, or a 40 kDa branched mPEG2) may be performed at 

20 4°C using a 5 fold molar excess of PEG reagent: polypeptide for 4-8 h in 50 mM MES, pH 
5.5, 100 mM NaCl, and 20 mM sodium cyanoborohydride, or in 50 mM sodium acetate, 
pH 5.5, 100 mM NaCl and 20 mM sodium cyanoborohydride. Conjugates are isolated by 
chromatography over a SP-Sepharose HP column equilibrated in 50 mM MES pH 5.5, 
100 mM NaCl, using a gradient from 100-600 mM NaCl in 50mM MES pH 5.5. 

25 Fractions containing monoPEGylated species are pooled and formulated for assays for 
interferon-alpha activity as described above, and may be further characterized by, for 
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example, using analytical HPLC, amino acid analysis and/or MALDI-TOF mass 
spectrometry. 

Lysine PEGylation 

5 In general, lysine-PEGylation was performed using a procedure essentially as 

described by Foser et al. (2003) Protein Expression & Purification 30:78-87. Briefly, the 
polypeptide was reacted with 40 kDa mPEG2-NHS at about a 3:1 to 5:1 molar ratio of 
mPEG2-NHS polypeptide in 50 mM borate buffer, pH 9.0, at 4°C for about 1 hr to 
overnight. Conjugates were isolated by cation exchange chromatography. Fractions 

10 containing monoPEGylated species were pooled and formulated for interferon-alpha 
activity assays as described above. MonoPEGylated IFN-alpha conjugates prepared 
according to the above procedure exhibited antiviral activity in the HeLa-EMCV assay, 
albeit lower than that of the parent unPEGylated polypeptides. 

In some instances, the monoPEGylated pool obtained as described above was further 

15 analyzed by cation exchange HPLC using a Polysulfoethyl A column (2. ImmID x 200 
mmL, 5m, 1000A 0 , from PolyLC) to characterize the PEG-isomer profile. The mobile 
phases were buffer A: 5 mM KH2PO4, 40% 1-propanol, 1% diethylene glycol, pH 3.0 
and bufferB: same as A with 200 mM NaCl. The PEG positional isomers were separated 
using a gradient from 50%-75% mobile phase B over 80 min at 0.1 ml/min, 20°C with 

20 detection at 214 nm. 

Figure 8 shows the separation by cation exchange HPLC of the monoPEGylated 
pool obtained from the reaction of interferon-alpha polypeptide B9xl4EC4 with 40 kDa 
mPEG2-NHS according to the above procedure. The peaks were collected and analyzed 
on SDS-PAGE (Figure 9). Lanes 2 and 9 of Fig. 9 shows that the sample prior to 

25 application onto the HPLC column contains predominantly monoPEGylated protein. 

Peaks 1-4 (lanes 3-6 of Fig. 9) correspond to the small proportion of diPEGylated IFN in 
the monoPEGylated pool. Minor peak 5 (lane 7 of Fig. 9) contains approximately equal 
proportions of diPEGylated, monoPEGylated and a lower molecular weight species. 
Peaks 6-11 (lanes 10-15 of Fig. 9) correspond to positional isomers of the 

30 monoPEGylated IFN polypeptide. 
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Polypeptide 14epil8 (SEQ ID NO: 13) was lysine-PEGylated as follows. The eluate 
of the Q-Sepharose chromatography step (obtained according to the Materials and 
Methods section above) was concentrated to c < 2 mg/ml. For PEGylation the buffer was 
exchanged to 50 mM sodium borate, pH 9.0. The polypeptide was reacted with 40 kDa 

5 mPEG2-NHS essentially as described by Foser et al (2003; Protein Expression & 

Purification 30:78-87) at a molar ratio of PEG reagent over protein of 4:1 or 5: 1. The 
PEGylation reaction was allowed to proceed at 4°C for 1.5 h. MonoPEGylated 14epil8 
was purified from the reaction mixture by cation exchange chromatography. Elution was 
performed in three steps using increasing salt concentrations up to 1 M NaCl to separate 

10 oligoPEGylated, monoPEGylated and non-PEGylated species. The monoPEGylated 

14epil8 peak was formulated at a concentration of 0.33 mg/ml for further analysis. The 
monoPEGylated 14epil8 composition so obtained yielded a population of positional 
isomers predominantly monoPEGylated at Lysl22 and Lysl35. 

Polypeptide 25epil9 (SEQ ID NO:38) was lysine-PEGylated as follows. The eluate 

15 of the Q-Sepharose chromatography step (obtained according to the Materials and 

Methods section above) was concentrated to c < 2 mg/ml. For PEGylation the buffer was 
exchanged to 50 mM sodium borate, pH 9.0. The polypeptide was reacted with 40 kDa 
mPEG2-NHS essentially as described by Foser et al (2003; Protein Expression & 
Purification 30:78-87) at a molar ratio of PEG reagent over protein of 4: 1. The 

20 PEGylation reaction was allowed to proceed at 4°C overnight. MonoPEGylated 25epil9 
was purified from the reaction mixture by cation exchange chromatography. Elution was 
performed in three steps using increasing salt concentrations up to 750 mM NaCl to 
separate oligoPEGylated, monoPEGylated and non-PEGylated species. The 
monoPEGylated 25epil9 peak was concentrated in the elution buffer to < 2 mg/ml and 

25 formulated for further analysis. The monoPEGylated 25epil9 composition so obtained 
yielded a population of positional isomers predominantly monoPEGylated at Lys31, 
Lysl22 and Lysl35. 

MonoPEGylated 14epil8 and monoPEGylated 25epil9 prepared as described above 
both exhibited greater than 10-fold higher antiviral activity than monoPEGylated IFN- 
30 alpha 2a in the HeLa-EMCV antiviral assay. 
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EXAMPLE 6: DIFFERENTIAL SCANNING CALORIMETRY 

Differential scanning calorimetry (DSC) experiments were performed to estimate the 
stability of some polypeptides of the invention. Samples of protein at concentrations 
from 100 to 325 |ag/ml were buffer-exchanged into phosphate buffered saline (PBS) and 
5 subjected to DSC using a MicroCal VP-DSC (MicroCal, Northampton, MA). The 
melting temperature (T m ) was estimated from the plot of heat capacity (C p ) vs 
temperature. Estimated T m s are: B9X14EC4 (SEQ ID NO:l) 58.4°C; 14epil6 (SEQ ID 
NO: 11) 64.2°C; and 14epil8 (SEQ ID NO: 13) 59.5°C. The substitution of a buried 
phenylalanine residue at position 48 (in SEQ ID NO: 1) for either a leucine (in SEQ ID 
10 NO: 1 1) or an alanine (in SEQ ID NO: 13) appears to have no detrimental effect on the 
stability of the polypeptide. 

EXAMPLE 7: IN VIVO ASSAYS 

Measurement of pharmacokinetic (PK) and pharmacodynamic (PD) profiles of a 
polypeptide or conjugate of the invention 

15 Measurement of biological or serum half-life may be carried out in a number of 

ways described in the literature. For example, biological half-life may be determined 
using an ELISA method to detect serum levels of interferon-alpha after e.g. subcutaneous 
or intramuscular administration. Use of an ELISA method to determine the 
pharmacokinetics of interferon-alpha administered subcutaneously is e.g. described by 

20 Rostaing et al. (1998), J. Am. Soc. Nephrol. 9(12): 2344-48. Merimsky et al. (1991), 
Cancer Chemother. Pharmacol. 27(5); 406-8, describe the determination of the serum 
level of an interferon-alpha administered intramuscularly. 

For example, the pharmacokinetic (PK) and pharmacodynamic (PD) profiles of a 
conjugate of the invention may be studied in cynomolgus monkeys (Macaca fascicularis) 

25 according to the following procedure. Three to four year old female animals with an 

average weight of 5-6 kg are used in the study. Animals are acclimatized for a minimum 
of 6 weeks between arrival and start of treatment with a minimum period of 3 weeks 
acclimatization to the study housing conditions. They are housed singly in stainless-steel 
mesh cages (at least 540 x 810 x 760 mm) and are kept on an expanded complete 

30 commercial primate diet (100 grams/animal/day) analyzed for the absence of chemical 
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and bacterial contaminants. In addition, animals receive fruit daily (apple or banana). 
Animals are fasted before all procedures requiring anesthesia. ' 

Three days before study initiation and on the day of study initiation prior to 
administration of the compounds, blood samples are collected from all animals. 
5 Throughout the study, blood samples (1 ml of whole blood) are withdrawn from a 

femoral or cephalic/saphenous vessel of the unanesthetised manually restrained animal. 
The normal feeding regime is maintained during collection. Samples are collected in 
tubes without anticoagulant, followed by centrifugation at 3000 rpm (about 1760 g) for 
10 minutes at 4°C. The serum samples are then separated into two aliquots 

10 (approximately 200 \i\ each) and stored at -20°C. 

Prior to initiation of the study, the conjugate of the invention is formulated in 20 mM 
sodium acetate (pH 6.0) and 140 mM sodium chloride. At study initiation, the conjugate 
is administered once per animal at either 30 meg conjugate/kg animal weight (four 
animals), or 300 mcg/kg (four animals), using a sterile syringe and needle introduced 

15 subcutaneously after local disinfection with an aqueous solution of ethyl alcohol. Blood 
samples are collected at the following time points after administration of the conjugate: 4 
hours, 8 hours, 12 hours, 24 hours, 48 hours, 72 hours, 144 hours, 240 hours, 336 hours, 
456 hours, 552 hours, 600 hours, 624 hours, 648 hours, 672 hours, 696 hours, 720 hours 
and 744 hours. 

20 The animals are observed daily to detect any clinical signs or reaction to treatment. 

Injection sites are visually assessed daily in order to evaluate local tolerance. A detailed 
clinical examination is performed before the initiation of treatment and at study 
termination. 

The concentration of conjugate in the isolated serum samples are determined after 
25 completion of the study, using an ELISA assay. The in vivo half-life of the conjugate in 
cynomolgus monkeys is then determined based on the serum concentration of the 
compounds at the specified time points. 

The pharmacodynamic response in cynomolgus monkeys to treatment with a 
conjugate of the invention is evaluated by measuring the levels of 2\5'-oligoadenylate 
30 synthetase and neopterin in each of the serum samples, using a commercially available 
RIA (Eiken Chemical Co., Tokyo) and a commercially available ELISA (IBL Immuno 
Biological Laboratories, Hamburg.), respectively. Based on the measured concentrations 
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of these two biomarkers for interferon activity in the serum samples, the area under the 
curve (AUC) is determined to quantify the in vivo activity of each compound at each of 
the two dosages. 

Determining in vitro immunogenicity 

Reduced immunogenicity of a polypeptide or conjugate of the invention relative to a 
reference molecule can be determined by use of an ELIS A method measuring the 
immunoreactivity of the molecule relative to a reference molecule or preparation, 
typically a known interferon-alpha protein. The ELISA method is based on antibodies 
from patients treated with the reference protein. The immunogenicity is considered to be 
reduced when the polypeptide or conjugate of the invention has a statistically significant 
lower response in the assay than the reference molecule or preparation. 

While the foregoing invention has been described in some detail for purposes of 
clarity and understanding, it will be clear to one skilled in the art from a reading of this 
disclosure that various changes in form and detail can be made without departing from the 
true scope of the invention. It is understood that the examples and embodiments 
described herein are for illustrative purposes only and that various modifications or 
changes in light thereof will be suggested to persons skilled in the art and are to be 
included within the spirit and purview of this application and scope of the appended 
claims. For example, all the techniques and apparatus described above may be used in 
various combinations. All publications, patents, patent applications, and/or other 
documents cited in this application are incorporated herein by reference in their entirety 
for all purposes to the same extent as if each individual publication, patent, patent 
application, and/or other document were individually indicated to be incorporated herein 
by reference in its entirety for all purposes. 
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WHAT IS CLAIMED IS : 

L An isolated or recombinant polypeptide comprising a sequence which 

(a) differs in 1 to 16 amino acid positions from SEQ ID NO: 1 and 

(b) comprises one or more substitution relative to SEQ ID NO: 1 selected from the 
group consisting of F48A/L; V51P; F55A; F65A; F68P; LI 1 1A; and VI 14P; 

which polypeptide exhibits antiviral activity. 

2. The polypeptide of claim 1, comprising a sequence which differs from SEQ ID 
NO:l in 1 to 10 amino acid positions. 

3. The polypeptide of claim 1, comprising a sequence which differs from SEQ ID 
NO:l in 1 to 8 amino acid positions. 

4. The polypeptide of claim 1, further comprising one or more substitution selected 
from the group consisting of H47Q; V51A; Q52P/E; A53T; F55S; L56V; F57L; Y58H; 
M61I; N113K; V114E; andE160D. 

5. The polypeptide of claim 1, comprising a sequence selected from the group 
consisting of SEQ ID NO:2-35. 

6. The polypeptide of claim 1, which sequence comprises the substitution F48A/L 
relative to SEQ ID NO: 1 . 

7. The polypeptide of claim 6, comprising a sequence selected from the group 
consisting of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
NO: 14, SEQ ID NO: 15, SEQ ID NO: 18, SEQ ED NO: 19, SEQ ID NO:20, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:26, SEQ ID NO:27, SEQ ID NO:30, SEQ ID NO:34 and SEQ ID NO:35, 
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8. An isolated or recombinant polypeptide comprising a sequence which 

(a) differs in 0 to 16 amino acid positions from SEQ ID NO: 13 and 

(b) comprises one or more of: Ala or Leu at position 48; Pro at position 51; Ala at 
position 55; Ala at position 65; Pro at position 68; Ala at position 111; and Pro at position 
114; 

which polypeptide exhibits antiviral activity. 

9. The polypeptide of claim 8, which sequence comprises Ala or Leu at position 48. 

10. The polypeptide of claim 8, comprising a sequence which differs from SEQ ID 
NO: 13 in 0 to 8 amino acid positions. 

11. An isolated or recombinant polypeptide comprising the sequence SEQ ID NO: 13, 
optionally further comprising a methionine at the N-terminus, which polypeptide exhibits 
antiviral activity. 

12. An isolated or recombinant polypeptide comprising a sequence which 

(a) differs in 1 to 16 amino acid positions from SEQ ID NO:36 and 

(b) comprises one or more substitution relative to SEQ ID NO:36 selected from the 
group consisting of M21A, I24P, F48A/L, T51P, S55A, F65A, F68P, F90A, M93P, 

LI 1 1 A, VI 14P, F124A, I127P, and E160D; 

which polypeptide exhibits antiviral activity. 

13. The polypeptide of claim 12, comprising a sequence which differs from SEQ ID 
NO:36 in 1 to 10 amino acid positions. 

14. The polypeptide of claim 12, comprising a sequence which differs from SEQ ID 
NO: 36 in 1 to 8 amino acid positions. 

15. The polypeptide of claim 12, further comprising one or more substitution selected 
from the group consisting of P26L, H47Q, T51V, S55P/F, V56L, H58Y, L60M, F90Y, 
M93L, N95D, N113K, V114E, R125Q, T132K, and L154F. 
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16. The polypeptide of claim 12, comprising a sequence selected from the group 
consisting of SEQ ID NO: 37-44. 

17. The polypeptide of claim 12, which sequence comprises the substitution F48A/L 
relative to SEQ ED NO:36. 

18. The polypeptide of claim 17, comprising a sequence selected from the group 
consisting of SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID 
NO:41, SEQ ID NO:42, and SEQ ID NO:43. 

19. The polypeptide of claim 12, comprising the sequence SEQ ID NO:44. 

20. An isolated or recombinant polypeptide comprising a sequence which 

(a) differs in 0 to 16 amino acid positions from SEQ ID NO:38 and 

(b) comprises one or more of: Ala at position 21; Pro at position 24; Ala or Leu at 
position 48; Pro at position 51; Ala at position 55; Ala at position 65; Pro at position 68; 
Ala at position 90; Pro at position 93; Ala at position 111; Pro at position 1 14; Ala at 
position 124; Pro at position 127; and Glu at position 160; 

which polypeptide exhibits antiviral activity. 

21. The polypeptide of claim 20, which sequence comprises Ala or Leu at position 48. 

22. The polypeptide of claim 20, comprising a sequence which differs from SEQ ID 
NO: 38 in 0 to 8 amino acid positions. 

23. An isolated or recombinant polypeptide comprising the sequence SEQ ID NO:38, 
optionally further comprising a methionine at the N-terminus, which polypeptide exhibits 
antiviral activity. 

24. The polypeptide of claim 1 to 23, wherein the antiviral activity of the polypeptide is 
equal to or greater than the antiviral activity of huIFN-alpha 2b or huIFN-alpha 2a. 
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25. The polypeptide of claim 24, wherein the antiviral activity of the polypeptide is at 
least two-fold greater than the antiviral activity of huIFN-alpha 2b or huIFN-alpha 2a. 

26. The polypeptide of claim 1 to 23, wherein the polypeptide exhibits a ratio of antiviral 
activity/antiproliferative activity at least two-fold greater than the ratio of antiviral 
activity/antiproliferative activity exhibited by huIFN-alpha 2b or huIFN-alpha 2a. 

27. A conjugate comprising 

(a) a polypeptide of claim 1 to 26, and 

(b) a non-polypeptide moiety covalently attached to the polypeptide. 

28. The conjugate of claim 27, comprising at least two non-polypeptide moieties. 

29. The conjugate of claim 27, comprising a non-polypeptide moiety covalently attached 
to a cysteine residue. 

30. The conjugate of claim 27, comprising a non-polypeptide moiety covalently attached 
to a lysine residue or to the N-terminal amino group. 

31. The conjugate of claim 27, comprising a non-polypeptide moiety covalently attached 
to a lysine residue. 

32. The conjugate of claim 27, comprising a non-polypeptide moiety attached to the N- 
terminal amino group. 

33. The conjugate of claim 27, wherein the non-polypeptide moiety is a polymer. 

34. The conjugate of claim 33, wherein the polymer is a polyethylene glycol. 

35. The conjugate of claim 34, wherein the polyethylene glycol is a 40 kDa mPEG2 
moiety covalently attached to a lysine residue. 
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36. The conjugate of claim 27, wherein the non-polypeptide moiety is a sugar. 

37. The conjugate of claim 36, wherein the sugar is attached to an N-glycosylation site 
of the polypeptide. 

38. A conjugate comprising 

(a) a polypeptide comprising the sequence SEQ ID NO: 13, optionally further 
comprising a methionine at the N-terminus; and 

(b) a PEG moiety covalently attached to a lysine residue of the polypeptide, 
which conjugate exhibits antiviral activity. 

39. The conjugate of claim 38, wherein the PEG moiety is a 40 kDa mPEG2 moiety. 

40. The conjugate of claim 39, wherein the 40 kDa mPEG2 moiety is covalently 
attached to a lysine residue selected from Lysl22 and Lysl35. 

41. A conjugate comprising 

(a) a polypeptide comprising the sequence SEQ ID NO:38, optionally further 
comprising a methionine at the N-terminus; and 

(b) a PEG moiety covalently attached to a lysine residue of the polypeptide, 
which conjugate exhibits antiviral activity. 

42. The conjugate of claim 41, wherein the PEG moiety is a 40 kDa mPEG2 moiety. 

43. The conjugate of claim 42, wherein the 40 kDa mPEG2 moiety is covalently 
attached to a lysine residue selected from Lys31, Lysl22, and Lysl35. 

44. A composition comprising 

(a) a conjugate comprising a polypeptide comprising the sequence SEQ ID NO: 13, 
optionally further comprising a methionine at the N-terminus, and a 40 kDa mPEG2 
moiety covalently attached to Lysl22; and 
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(b) a conjugate comprising a polypeptide comprising the sequence SEQ ID NO: 13, 
optionally further comprising a methionine at the N-terminus, and a 40 kDa mPEG2 
moiety covalently attached to Lysl35; 

which composition exhibits antiviral activity. 

45. A composition comprising 

(a) a conjugate comprising a polypeptide comprising the sequence SEQ ID NO:38, 
optionally further comprising a methionine at the N-terminus, and a 40 kDa mPEG2 
moiety covalently attached to Lys3 1 ; 

(b) a conjugate comprising a polypeptide comprising the sequence SEQ ID NO:38, 
optionally further comprising a methionine at the N-terminus, and a 40 kDa mPEG2 
moiety covalently attached to Lysl22; and 

(b) a conjugate comprising a polypeptide comprising the sequence SEQ ID NO:38, 
optionally further comprising a methionine at the N-terminus, and a 40 kDa mPEG2 
moiety covalently attached to Lysl35; 

which composition exhibits antiviral activity. 

46. A composition comprising the polypeptide of claim 1 to 26 and a pharmaceutically 
acceptable excipient. 

47. A composition comprising the conjugate of claim 27 to 43 and a pharmaceutically 
acceptable excipient. 

48. The composition of claim 44 to 45 further comprising a pharmaceutically acceptable 
excipient. 

49. An isolated or recombinant polynucleotide comprising a nucleic acid sequence 
which encodes the polypeptide of claim 1 to 26. 

50. The polynucleotide of claim 49, comprising a nucleic acid sequence which encodes a 
polypeptide comprising the sequence SEQ ID NO: 13. 
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5 1 . The polynucleotide of claim 50, comprising a nucleic acid sequence selected from 
SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, and SEQ ID NO:89. 

52. The polynucleotide of claim 49, comprising a nucleic acid sequence which encodes a 
polypeptide comprising the sequence SEQ ID NO: 38. 

53. The polynucleotide of claim 52, comprising a nucleic acid sequence selected from 
SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, and SEQ ID NO:90. 

54. A host cell comprising the polynucleotide of claim 49. 

55. A vector comprising the polynucleotide of claim 49. 

56. The vector of claim 55, which is an expression vector comprising the polynucleotide 
operably linked to a promoter. 

57. A host cell comprising the vector of claim 56. 

58. A composition comprising the polynucleotide of claim 49 and an excipient. 

59. A method for preparing a polypeptide of claim 1 to 26, the method comprising: 
providing a culture comprising a host cell, the host cell comprising an expression 

vector comprising a promoter operably linked to a polynucleotide, the polynucleotide 
comprising a nucleic acid sequence which encodes the polypeptide, 

culturing the culture under conditions which permit expression of the polypeptide, 

and 

recovering the polypeptide. 

60. The method of claim 59, wherein the host cell is a glycosylating host cell. 

61. The method of claim 59, wherein the host cell is a bacterial host cell. 
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62. The method of claim 59, wherein the bacterial host cell is E. coli. 

63. The method of claim 62, wherein the step of recovering the polypeptide comprises: 

(a) isolating inclusion bodies (IB) comprising the polypeptide; 

(b) solubilizing the IB, thereby obtaining unfolded polypeptide; 

(c) refolding the unfolded polypeptide, thereby obtaining refolded polypeptide; and 

(d) purifying the refolded polypeptide. 

64. A method for preparing a conjugate, the method comprising 

(i) providing the polypeptide of claim 1 to 26, and 

(ii) attaching at least one non-polypeptide moiety to the polypeptide, wherein the 
resulting conjugate exhibits antiviral activity. 

65. The method of claim 64, wherein the step of providing the polypeptide comprises: 
providing a culture comprising a host cell, the host cell comprising an expression 

vector comprising a promoter operably linked to a polynucleotide, the polynucleotide 
comprising a nucleic acid sequence which encodes the polypeptide, 

culturing the culture under conditions which permit expression of the polypeptide, 

and 

recovering the polypeptide. 

66. The method of claim 65, wherein the host cell is a glycosylating host cell or a 
bacterial host cell. 

67. The method of claim 66, wherein the bacterial host cell is E. coli. 

68. The method of claim 67, wherein the step of recovering the polypeptide comprises: 

(a) isolating inclusion bodies (IB) comprising the polypeptide; 

(b) solubilizing the IB, thereby obtaining unfolded polypeptide; 

(c) refolding the unfolded polypeptide, thereby obtaining refolded polypeptide; and 

(d) purifying the refolded polypeptide. 
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69. The method of claim 64, wherein the step of attaching the at least one non- 
polypeptide moiety to the polypeptide comprises: 

reacting the polypeptide with 40 kDa mPEG2-NHS at a molar ratio of mPEG2- 
NHS polypeptide of about 3:1 to 5:1 at pH 9 and 4°C for 1 hour to overnight. 

70. A method for reducing the number of copies of a virus in cells infected with the 
virus, the method comprising: administering the polypeptide of claim 1 to 26, the 
conjugate of claim 27 to 43, or the composition of claim 44 to 48 to the cells in an amount 
effective to reduce the number of copies of the virus in the cells, thereby reducing the 
number of copies of the virus in said cells. 

71. A method for reducing the level of HCV RNA in the serum of a patient infected with 
HCV, comprising administering to the patient the polypeptide of claim 1 to 26, the 
conjugate of claim 27 to 43, or the composition of claim 44 to 48 in an amount effective 
to reduce the level of HCV RNA compared to the HCV RNA level present prior to the 
start of treatment. 

72. A method for reducing the level of HB V DNA in serum of a patient infected with 
HBV, comprising administering to the patient the polypeptide of claim 1 to 26, the 
conjugate of claim 27 to 43, or the composition of claim 44 to 48 in an amount effective 
to reduce the level of HBV DNA compared to the HBV DNA level present prior to the 
start of treatment. 

73. A method for reducing the level of HIV RNA in serum of a patient infected with 
HIV, comprising administering to the patient the polypeptide of claim 1 to 26, the 
conjugate of claim 27 to 43, or the composition of claim 44 to 48 in an amount effective 
to reduce the level of HIV RNA compared to the HIV RNA level present prior to the start 
of treatment. 

74. The polypeptide of claim 1 to 26, the conjugate of claim 27 to 43, or the composition 
of claim 44 to 48 for use as a medicament. 
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75. Use of the polypeptide of claim 1 to 26, the conjugate of claim 27 to 43, or the 
composition of claim 44 to 48 for the manufacture of a medicament for use in the 
treatment of illnesses. 

76. Use of the polypeptide of claim 1 to 26, the conjugate of claim 27 to 43, or the 
composition of claim 44 to 48 for the manufacture of a medicament for use in the 
treatment of viral disease. 

77. Use of the polypeptide of claim 1 to 26, the conjugate of claim 27 to 43, or the 
composition of claim 44 to 48 for the manufacture of a medicament for reducing the 
number of copies of a virus in cells infected with the virus. 

78. Use of the polypeptide of claim 1 to 26, the conjugate of claim 27 to 43, or the 
composition of claim 44 to 48 for the manufacture of a medicament for reducing the 
level of a virus in the serum of a patient infected with the virus. 

79. Use according to any of claims 61-63, wherein the virus is HCV, HBV, or HIV. 
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<110> Maxygen, Inc. 

F. Hof f mann-LaRoche AG 

<120> INTERFERON-ALPHA POLYPEPTIDES AND 
CONJUGATES 

<130> 0280.310WO 

<150> US 60/572 , 504 
<151> 2004-05-19 

<160> 90 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct B9xl4EC4 
<400> 1 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

1 5 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Phe 

35 40 45 

Gin Lys Val Gin Ala lie Phe Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 2 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi07 
<400> 2 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 
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1 
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10 










15 




Leu 


Leu. 


Ala 


Gin 


Met 


Arg 


Arcr 


He 


Ser 


Leu 


Phe 


Ser 


Cvs 


Leu 


Lys 


Asp 








2 0 










25 










3 0 






Arg 


His 


Asp 


Phe 


Arg 


Phe 


Pro 


Gin 


Glu 


Glu 


Phe 


Asp 


Glv 


Asn 


His 


Phe 






3 5 










40 










45 








Gin 


Lys 


Val 


Gin 


Ala 


He 


Pro 


Leu 


Phe 


Tvr 
j 


Glu 


Met 


Met 


Gin 


Gin 


Thr 




50 










55 










60 










Phe 


Asn 


Leu 


Phe 


Ser 


Thr 


Lys 


Asp 


Ser 


Ser 


Ala 


Ala 


Trn 


Asp 


Glu 


Thr 


65 










70 










75 










80 




JjSU 


ril n 


j_iy s 


Phe 




He 


Glu 


Leu 


Phe 


Gin 


Gin 


Met 


Asn 


Asp 


Leu 










85 










90 










95 






ri± ex. 


^y b 


Val 


Met 


Gin 


Glu 


Val 


Gly 


Val 


Glu 


Glu 


Thr 


Pro 


Leu 


Met 








100 










105 










110 






Asn 


Val 


Asp 


Ser 


He 


Leu 


Ala 


Val 


Arg 


Lys 


Tyr 


Phe 


Gin 


Arg 


He 


Thr 






115 










120 










125 








Leu 


Tyr 


Leu 


Thr 


Glu 


Lys 


Lys 


Tyr 


Ser 


Pro 


Cys 


Ala 


Trp 


Glu 


Val 


Val 




130 










135 










140 










Arg 


Ala 


Glu 


He 


Met 


Arg 


Ser 


Phe 


Ser 


Phe 


Ser 


Thr 


Asn 


Leu 


Gin 


Glu 


145 










150 










155 










160 


Ser 


Leu 


Arg 


Ser 


Lys 


Glu 























165 



<210> 3 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi08 



<400> 3 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg He Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Phe 

35 40 45 

Gin Lys Ala Pro Ala He Ser Val Leu His Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr He Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 4 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Synthetic construct 14epi09 
<400> 4 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Phe 

35 40 45 

Gin Lys Ala Glu Thr lie Pro Val Leu His Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 5 
<211> 166 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Synthetic 
<400> 5 

Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Ala Glu 
50 

Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Ser Leu Arg Ser 



construct 14epi: 

Gin Thr His Ser 
5 

Met Arg Arg lie 

Arg Phe Pro Gin 
40 

Ala lie Ser Val 
55 

Ser Thr Lys Asp 
70 

Phe Tyr lie Glu 
85 

Met Gin Glu Val 

lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



0 



Leu Gly His Arg 
10 

Ser Leu Phe Ser 
25 

Glu Glu Phe Asp 

Leu His Glu Met 
60 

Ser Ser Ala Ala 
75 

Leu Phe Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Arg Thr Met Met 
15 

Cys Leu Lys Asp 
30 

Gly Asn His Phe 
45 

Met Gin Gin Thr 

Trp Asp Glu Thr 
80 

Met Asn Asp Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg lie Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 
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<210> 6 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epill 



<400> 6 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Phe 

35 40 45 

Gin Lys Ala Gin Ala lie Ser Ala Phe His Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 7 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epil2d 



<400> 7 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Leu 

35 40 45 

Gin Lys Val Gin Ala lie Phe Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Asp 
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145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 8 
<211> 166 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Synthetic construct 14epil3 



<400> 8 

Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Val Gin 
50 

Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Ser Leu Arg Ser 



Gin Thr His Ser 
5 

Met Arg Arg lie 

Arg Phe Pro Gin 
40 

Ala lie Phe Leu 
55 

Ser Thr Lys Asp 
70 

Phe Tyr lie Glu 
85 

Met Gin Glu Val 

lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



Leu Gly His Arg 
10 

Ser Leu Phe Ser 
25 

Glu Glu Phe Asp 

Phe Tyr Glu Met 
60 

Ser Ser Ala Ala 
75 

Leu Phe Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Arg Thr Met Met 
15 

Cys Leu Lys Asp 
30 

Gly Asn Gin Phe 
45 

Met Gin Gin Thr 

Trp Asp Glu Thr 
80 

Met Asn Asp Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg lie Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 



<210> 9 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epil4 



<400> 9 



Cys 


Asp 


Leu 


Pro 


Gin 


Thr 


His 


Ser 


Leu 


Gly 


His 


Arg 


Arg 


Thr 


Met 


Met 


1 








5 










10 










15 




Leu 


Leu 


Ala 


Gin 
20 


Met 


Arg 


Arg 


He 


Ser 
25 


Leu 


Phe 


Ser 


Cys 


Leu 
30 


Lys 


Asp 


Arg 


His 


Asp 
35 


Phe 


Arg 


Phe 


Pro 


Gin 
40 


Glu 


Glu 


Phe 


Asp 


Gly 
45 


Asn 


His 


Phe 


Gin 


Lys 
50 


Val 


Gin 


Ala 


lie 


Phe 
55 


Leu 


Phe 


Tyr 


Glu 


Met 
60 


He 


Gin 


Gin 


Thr 


Phe 


Asn 


Leu 


Phe 


Ser 


Thr 


Lys 


Asp 


Ser 


Ser 


Ala 


Ala 


Trp 


Asp 


Glu 


Thr 


65 










70 










75 










80 


Leu 


Leu 


Glu 


Lys 


Phe 
85 


Tyr 


He 


Glu 


Leu 


Phe 
90 


Gin 


Gin 


Met 


Asn 


Asp 
95 


Leu 


Glu 


Ala 


Cys 


val 

100 


Met 


Gin 


Glu 


Val 


Gly 
105 


Val 


Glu 


Glu 


Thr 


Pro 
110 


Leu 


Met 
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Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Ser Leu Arg Ser 



lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Gin Arg lie Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 



<210> 10 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epil5 
<400> 10 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Leu 

35 40 45 

Gin Lys Val Gin Ala lie Phe Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 11 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epil6 
<400> 11 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Leu 

35 40 45 

Gin Lys Val Gin Ala lie Ala Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
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65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Ser Leu Arg Ser 



70 

Phe Tyr lie Glu 
85 

Met Gin Glu Val 

lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



75 

Leu Phe Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



80 

Met Asn Asp Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg lie Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 



<210> 12 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epil7 



<400> 12 



Cys 


Asp 


Leu 


Pro 


Gin 


Thr 


His 


Ser 


Leu 


Gly 


His 


Arg 


Arg 


Thr 


Met 


Met 


1 








5 










10 










15 




Leu 


Leu 


Ala 


Gin 


Met 


Arg 


Arg 


He 


Ser 


Leu 


Phe 


Ser 


Cys 


Leu 


Lys 


Asp 








20 










25 










30 






Arg 


His 


Asp 


Phe 


Arg 


Phe 


Pro 


Gin 


Glu 


Glu 


Phe 


Asp 


Gly 


Asn 


His 


Leu 






35 










40 










45 








Gin 


Lys 


Val 


Gin 


Ala 


He 


Ser 


Leu 


Phe 


Tyr 


Glu 


Met 


Met 


Gin 


Gin 


Thr 




50 










55 










60 










Phe 


Asn 


Leu 


Phe 


Ser 


Thr 


Lys 


Asp 


Ser 


Ser 


Ala 


Ala 


Trp 


Asp 


Glu 


Thr 


65 










70 










75 










80 


Leu 


Leu 


Glu 


Lys 


Phe 


Tyr 


He 


Glu 


Leu 


Phe 


Gin 


Gin 


Met 


Asn 


Asp 


Leu 










85 










90 










95 




Glu 


Ala 


Cys 


Val 


Met 


Gin 


Glu 


Val 


Gly 


Val 


Glu 


Glu 


Thr 


Pro 


Leu 


Met 








100 










105 










110 






Asn 


Val 


Asp 


Ser 


He 


Leu 


Ala 


Val 


Arg 


Lys 


Tyr 


Phe 


Gin 


Arg 


He 


Thr 






115 










120 










125 








Leu 


Tyr 


Leu 


Thr 


Glu 


Lys 


Lys 


Tyr 


Ser 


Pro 


Cys 


Ala 


Trp 


Glu 


Val 


Val 




130 










135 










140 










Arg 


Ala 


Glu 


He 


Met 


Arg 


Ser 


Phe 


Ser 


Phe 


Ser 


Thr 


Asn 


Leu 


Gin 


Glu 


145 










150 










155 










160 


Ser 


Leu 


Arg 


Ser 


Lys 


Glu 























165 



<210> 13 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epil8 
<400> 13 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg He Ser Leu Phe Ser Cys Leu Lys Asp 
20 25 30 
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Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Ala 

35 40 45 

Gin Lys Val Gin Ala lie Phe Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 14 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epil9 



<400> 14 



Cys 


Asp 


Leu 


Pro 


Gin 


Thr 


His 


Ser 


Leu 


Gly 


His 


Arg 


Arg 


Thr 


Met 


Met 


1 








5 










10 










15 




Leu 


Leu 


Ala 


Gin 


Met 


Arg 


Arg 


He 


Ser 


Leu 


Phe 


Ser 


Cys 


Leu 


Lys 


Asp 








20 










25 










30 






Arg 


His 


Asp 


Phe 


Arg 


Phe 


Pro 


Gin 


Glu 


Glu 


Phe 


Asp 


Gly 


Asn 


His 


Ala 






35 










40 










45 








Gin 


Lys 


Val 


Gin 


Ala 


He 


Ala 


Leu 


Phe 


Tyr 


Glu 


Met 


Met 


Gin 


Gin 


Thr 




50 










55 










60 










Phe 


Asn 


Leu 


Phe 


Ser 


Thr 


Lys 


Asp 


Ser 


Ser 


Ala 


Ala 


Trp 


Asp 


Glu 


Thr 


65 










70 










75 










80 


Leu 


Leu 


Glu 


Lys 


Phe 


Tyr 


lie 


Glu 


Leu 


Phe 


Gin 


Gin 


Met 


Asn 


Asp 


Leu 










85 










90 










95 




Glu 


Ala 


Cys 


Val 


Met 


Gin 


Glu 


Val 


Gly 


Val 


Glu 


Glu 


Thr 


Pro 


Leu 


Met 








100 










105 










110 






Asn 


Val 


Asp 


Ser 


He 


Leu 


Ala 


Val 


Arg 


Lys 


Tyr 


Phe 


Gin 


Arg 


He 


Thr 






115 










120 










125 








Leu 


Tyr 


Leu 


Thr 


Glu 


Lys 


Lys 


Tyr 


Ser 


Pro 


Cys 


Ala 


Trp 


Glu 


Val 


Val 




130 










135 










140 










Arg 


Ala 


Glu 


He 


Met 


Arg 


Ser 


Phe 


Ser 


Phe 


Ser 


Thr 


Asn 


Leu 


Gin 


Glu 


145 










150 










155 










160 


Ser 


Leu 


Arg 


Ser 


Lys 


Glu 























165 



<210> 15 

<211> 166 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi20 
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<400> 15 
Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Val Gin 
50 

Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Ser Leu Arg Ser 



Gin Thr His Ser 
5 

Met Arg Arg lie 

Arg Phe Pro Gin 
40 

Ala lie Ser Leu 
55 

Ser Thr Lys Asp 
70 

Phe Tyr lie Glu 
85 

Met Gin Glu Val 

lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



Leu Gly His Arg 
10 

Ser Leu Phe Ser 
25 

Glu Glu Phe Asp 

Phe Tyr Glu Met 
60 

Ser Ser Ala Ala 
75 

Leu Phe Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Arg Thr Met Met 
15 

Cys Leu Lys Asp 
30 

Gly Asn His Ala 
45 

Met Gin Gin Thr 

Trp Asp Glu Thr 
80 

Met Asn Asp Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg He Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 



<210> 16 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi21 



<400> 16 



Cys 


Asp 


Leu 


Pro 


Gin 


Thr 


His 


Ser 


Leu Gly His Arg Arg 


Thr 


Met 


Met 


1 








5 










10 










15 




Leu 


Leu 


Ala 


Gin 


Met 


Arg 


Arg 


He 


Ser 


Leu 


Phe 


Ser 


Cys 


Leu 


Lys 


Asp 








20 










25 










30 






Arg 


His 


Asp 


Phe 


Arg 


Phe 


Pro 


Gin 


Glu 


Glu 


Phe 


Asp 


Gly Asn 


His 


Phe 






35 










40 










45 








Gin 


Lys 


Val 


Gin 


Ala 


He 


Ala 


Leu 


Phe 


Tyr 


Glu 


Met 


Met 


Gin 


Gin 


Thr 




50 










55 










60 










Phe 


Asn 


Leu 


Phe 


Ser 


Thr 


Lys 


Asp 


Ser 


Ser 


Ala 


Ala 


Trp 


Asp 


Glu 


Thr 


65 










70 










75 










80 


Leu 


Leu 


Glu 


Lys 


Phe 


Tyr 


He 


Glu 


Leu 


Phe 


Gin 


Gin 


Met 


Asn 


Asp 


Leu 










85 










90 










95 




Glu 


Ala 


Cys 


Val 


Met 


Gin 


Glu 


Val 


Gly Val 


Glu 


Glu 


Thr 


Pro 


Leu 


Met 








100 










105 










110 






Asn 


Val 


Asp 


Ser 


He 


Leu 


Ala 


Val 


Arg 


Lys 


Tyr 


Phe 


Gin 


Arg 


lie 


Thr 






115 










120 










125 








Leu 


Tyr 


Leu 


Thr 


Glu 


Lys 


Lys 


Tyr 


Ser 


Pro 


Cys 


Ala 


Trp 


Glu 


Val 


Val 




130 










135 










140 










Arg 


Ala 


Glu 


He 


Met 


Arg 


Ser 


Phe 


Ser 


Phe 


Ser 


Thr 


Asn 


Leu 


Gin 


Glu 


145 










150 










155 










160 


Ser 


Leu 


Arg 


Ser 


Lys 


Glu 























165 



<210> 17 
<211> 166 
<212> PRT 
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10/43 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi22 
<400> 17 

Cys Asp Leu Pro Gin Thr His. Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Phe 

35 40 45 

Gin Lys Val Gin Ala lie Ser Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 , 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 18 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi23 
<400> 18 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Ala 

35 40 45 

Gin Lys Val Gin Ala lie Ser Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin GLn Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Ala Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 
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<210> 19 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi24 
<400> 19 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Ala 

35 40 45 

Gin Lys Val Gin Ala lie Ser Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin 'Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Lys Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 20 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi25 
<400> 20 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Leu 

35 40 45 

Gin Lys Val Gin Ala lie Ser Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Ala Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
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130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 21 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi26 
<400> 21 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Leu 

35 40 45 

Gin Lys Val Gin Ala lie Ser Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Lys Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 22 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi27 
<400> 22 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Ala 

35 40 45 

Gin Lys Val Gin Ala lie Ser Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Ala Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 
85 90 95 
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Glu Ala Cys Val 
100 

Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Ser Leu Arg Ser 



Met Gin Glu Val 

lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



Gly val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Thr Pro Ala Met 
110 

Gin Arg He Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 



<210> 23 
<211> 166 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Synthetic construct 14epi28 



<400> 23 
Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Val Gin 
50 

Ala Asn Leu Phe 
65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Lys Glu Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu He 
145 

Ser Leu Arg Ser 



Gin Thr His Ser 
5 

Met Arg Arg He 

Arg Phe Pro Gin 
40 

Ala He Ser Leu 
55 

Ser Thr Lys Asp 
70 

Phe Tyr He Glu 
85 

Met Gin Glu Val 

He Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



Leu Gly His Arg 
10 

Ser Leu Phe Ser 
25 

Glu Glu Phe Asp 

Phe Tyr Glu Met 
60 

Ser Ser Ala Ala 
75 

Leu Phe Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Arg Thr Met Met 
15 

Cys Leu Lys Asp 
30 

Gly Asn His Ala 
45 

Met Gin Gin Thr 

Trp Asp Glu Thr 
80 

Met Asn Asp Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg He Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 



<210> 24 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi29 
<400> 24 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg He Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Ala 

35 40 45 

Gin Lys Val Gin Ala He Ser Leu Phe Tyr Glu Met Met Gin Gin Thr 



WO 2005/113592 



14/43 



PCT/US2005/017471 



50 55 60 



Ala 


Asn 


Leu 


Phe 


Ser 


Thr 


Lys 


Asp 


Ser 


Ser 


Ala 


Ala 


Trp 


Asp 


Glu 


Thr 


65 










70 










75 










80 


Leu 


Leu 


Glu 


Lys 


Phe 
85 


Tvr 


He 


Glu 


Leu 


Phe 
90 


Gin 


Gin 


Met 


Asn 


Asp 
95 


Leu 


Glu 


Ala 


Cys 


Val 
i no 


Met 


Gin 


Glu 


Val 


Gly 
105 


Val 


Glu 


Glu 


Thr 


Pro 
110 


Leu 


Met 


Asn 


Val 


Asp 
115 


Ser 


He 


Leu 


Ala 


Val 
120 


Arg 


Lys 


Tyr 


Phe 


Gin 
125 


Arg 


He 


Thr 


Leu 


Tyr 
130 


Leu 


Thr 


Glu 


Lys 


Lys 
135 


Tyr 


Ser 


Pro 


Cys 


Ala 
140 


Trp 


Glu 


Val 


Val 


Arg 


Ala 


Glu 


He 


Met 


Arg 


Ser 


Phe 


Ser 


Phe 


Ser 


Thr 


Asn 


Leu 


Gin 


Glu 


145 










150 










155 










160 


Ser 


Leu 


Arg 


Ser 


Lys 
165 


Glu 























<210> 25 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi30 



<400> 25 



Cys 


Asp 


Leu 


Pro 


Gin 


Thr 


His 


Ser 


Leu 


Gly 


His 


Arg 


Arg 


Thr 


Met 


Met 


1 








5- 










10 










15 




Leu 


Leu 


Ala 


Gin 


Met 


Arg 


Arg 


lie 


Ser 


Leu 


Phe 


Ser 


Cys 


Leu 


Lys 


Asp 








20 










25 










30 






Arg 


His 


Asp 


Phe 


Arg 


Phe 


Pro 


Gin 


Glu 


Glu 


Phe 


Asp 


Gly Asn 


His 


Ala 






35 










40 










45 








Gin 


Lys 


Val 


Gin 


Ala 


He 


Ala 


Leu 


Phe 


Tyr 


Glu 


Met 


Met 


Gin 


Gin 


Thr 




50 










55 










60 










Phe 


Asn 


Leu 


Phe 


Ser 


Thr 


Lys 


Asp 


Ser 


Ser 


Ala 


Ala 


Trp 


Asp 


Glu 


Thr 


65 










70 










75 










80 


Leu 


Leu 


Glu 


Lys 


Phe 


Tyr 


He 


Glu 


Leu 


Phe 


Gin 


Gin 


Met 


Asn 


Asp 


Leu 










85 










90 










95 




Glu 


Ala 


Cys 


Val 


Met 


Gin 


Glu 


Val 


Gly 


Val 


Glu 


Glu 


Thr 


Pro 


Ala 


Met 








100 










105 










110 






Asn 


Val 


Asp 


Ser 


He 


Leu 


Ala 


Val 


Arg 


Lys 


Tyr 


Phe 


Gin 


Arg 


He 


Thr 






115 










120 










125 








Leu 


Tyr 


Leu 


Thr 


Glu 


Lys 


Lys 


Tyr 


Ser 


Pro 


Cys 


Ala 


Trp 


Glu 


Val 


Val 




130 










135 










140 










Arg 


Ala 


Glu 


He 


Met 


Arg 


Ser 


Phe 


Ser 


Phe 


Ser 


Thr 


Asn 


Leu 


Gin 


Glu 


145 










150 










155 










160 


Ser 


Leu 


Arg 


Ser 


Lys 


Glu 























165 



<210> 26 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi31 
<400> 26 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 
15 10 15 
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Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Ala 

35 40 45 

Gin Lys Val Gin Ala lie Ala Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Lys Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 27 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi32 
<400> 27 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Leu 

35 40 45 

Gin Lys Val Pro Ala lie Ser Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Lys Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 28 
<211> 166 
<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> Synthetic construct 14epi33 
<400> 28 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Phe 

35 40 45 

Gin Lys Val Pro Ala lie Ser Leu Phe Tyr Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Lys Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 . 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 29 

<211> 166 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 29 

Cys Asp Leu Pro 

1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Val Pro 
50 

Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Asn Glu Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Ser Leu Arg Ser 



Sequence 



construct 14epi 



Gin Thr His Ser 
5 

Met Arg Arg lie 

Arg Phe Pro Gin 
40 

Ala lie Ser Leu 
55 

Ser Thr Lys Asp 
70 

Phe Tyr lie Glu 
85 

Met Gin Glu Val 

lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



4 



Leu Gly His Arg 
10 

Ser Leu Phe Ser 
25 

Glu Glu Phe Asp 

Phe Tyr Glu Met 
60 

Ser Ser Ala Ala 
75 

Leu Phe Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Arg Thr Met Met 
15 

Cys Leu Lys Asp 
30 

Gly Asn His Phe 
45 

Met Gin Gin Thr 

Trp Asp Glu Thr 
80 

Met Asn Asp Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg lie Thr 
125 

Trp Glu Val val 

Asn Leu Gin Glu 
160 



<210> 30 
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<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi35 



<400> 30 



C\r<=i 


Asp 


Leu 


Pro 


Gin 


Thr 


His 


Ser 


Leu 


Glv 


His 


Arg 


Arg 


Thr 


Met 


Met 


1 








5 










10 










15 




Leu 


Leu 


Ala 


Gin 


Met 






He 


Ser 


Leu 


Phe 


Ser 


Cvs 


Leu 


Lys 


Asp 








20 










25 










30 






Arg 


His 


Asp 


Phe 


Arg 


Phe 


Pro 


Gin 


Glu 


Glu 


Phe 


Asp 


Glv 


Asn 


His 


Leu 






35 










40 










45 








Gin 


Lys 


Ala 


Pro 


Ala 


He 


Ser 


Val 


Leu 


His 


Glu 


Met 


Met 


Gin 


Gin 


Thr 




50 










55 










60 










Phe 


Asn 


Leu 


Phe 


Ser 


Thr 


Lys 


Asp 


Ser 


Ser 


Ala 


Ala 


Trp 


Asp 


Glu 


Thr 


65 










70 










75 










80 


Leu 


Leu 


Glu 


Lys 


Phe 


Tyr 


He 


Glu 


Leu 


Phe 


Gin 


Gin 


Met 


Asn 


Asp 


Leu 










85 










90 










95 




Glu 


Ala 


Cys 


Val 


Met 


Gin 


Glu 


Val 


Gly 


Val 


Glu 


Glu 


Thr 


Pro 


Leu 


Met 








100 










105 










110 






Lys 


Glu 


Asp 


Ser 


He 


Leu 


Ala 


Val 


Arg 


Lys 


Tyr 


Phe 


Gin 


Arg 


He 


Thr 






115 










120 










125 








Leu 


Tyr 


Leu 


Thr 


Glu 


Lys 


Lys 


Tyr 


Ser 


Pro 


Cys 


Ala 


Trp 


Glu 


Val 


Val 




130 










135 










140 










Arg 


Ala 


Glu 


He 


Met 


Arg 


Ser 


Phe 


Ser 


Phe 


Ser 


Thr 


Asn 


Leu 


Gin 


Glu 


145 










150 










155 










160 


Ser 


Leu 


Arg 


Ser 


Lys 


Glu 























165 



<210> 31 
<211> 166 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 

<400> 31 
Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Ala Glu 
50 

Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Lys Glu Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu He 
145 



Sequence 



construct 14epi: 

Gin Thr His Ser 
5 

Met Arg Arg He 

Arg Phe Pro Gin 
40 

Thr He Pro Val 
55 

Ser Thr Lys Asp 
70 

Phe Tyr He Glu 
85 

Met Gin Glu Val 

He Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 
150 



6 



Leu Gly His Arg 
10 

Ser Leu Phe Ser 
25 

Glu Glu Phe Asp 

Leu His Glu Met 
60 

Ser Ser Ala Ala 
75 

Leu Phe Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Arg Thr Met Met 
15 

Cys Leu Lys Asp 
30 

Gly Asn His Phe 
45 

Met Gin Gin Thr 

Trp Asp Glu Thr 
80 

Met Asn Asp Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg He Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 
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Ser Leu Arg Ser Lys Glu 
165 



<210> 32 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi37 
<400> 32 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Phe 

35 40 45 

Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Lys Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 33 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi38 
<400> 33 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Phe 

35 40 45 

Gin Lys Ala Pro Ala lie Ser Val Leu His Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Lys Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 
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115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 34 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi39 



<400> 34 
Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Val Gin 
50 

Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Ser Leu Arg Ser 



Gin Thr His Ser 
5 

Met Arg Arg lie 

Arg Phe Pro Gin 
40 

Ala lie Ser Leu 
55 

Ser Thr Lys Asp 
70 

Phe Tyr lie Glu 
85 

Met Gin Glu Val 

lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



Leu Gly His Arg 
10 

Ser Leu Phe Ser 
25 

Glu Glu Phe Asp 

Phe Tyr Glu Met 

60 

Ser Ser Ala Ala 
75 

Leu Phe Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Arg Thr Met Met 
15 

Cys Leu Lys Asp 
30 

Gly Asn Gin Ala 
45 

Met Gin Gin Thr 

Trp Asp Glu Thr 
80 

Met Asn Asp Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg lie Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 



<210> 35 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic construct 14epi42 
<400> 35 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Ala 

35 40 45 

Gin Lys Val Gin Ala lie Ser Val Phe His Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 
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Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Ser Leu Arg Ser 



Phe Tyr lie Glu 
85 

Met Gin Glu Val 

lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



Leu Phe Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Met Asn Asp Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg lie Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 



<210> 36 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25Epi08 
<400> 36 

Cys Asp Leu Pro Gin Thr His Ser Leu Ser Asn Arg Arg Thr Leu Met 

1 5 10 15 

Leu Met Ala Gin Met Arg Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Glu Glu Glu Phe Asp Gly His Gin Phe 

35 40 45 

Gin Lys Thr Gin Ala lie Ser Val Leu His Glu Leu lie Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asn Leu 

85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu lie Ala Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Arg Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Leu Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 37 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25Epil8 
<400> 37 

Cys Asp Leu Pro Gin Thr His Ser Leu Ser Asn Arg Arg Thr Leu Met 

15 10 15 

Leu Met Ala Gin Met Arg Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Glu Glu Glu Phe Asp Gly His Gin Ala 
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35 

Gin Lys Thr Gin Ala 
50 

Phe Asn Leu Phe Ser 
65 

Leu Leu Glu Lys Phe 
85 

Glu Ala Cys Val lie 
100 

Asn Val Asp Ser lie 
115 

Leu Tyr Leu Thr Glu 
130 

Arg Ala Glu lie Met 
145 

Ser Leu Arg Ser Lys 
165 



40 

lie Ser Val Leu His 
55 

Thr Lys Asp Ser Ser 
70 

Tyr lie Glu Leu Phe 
90 

Gin Glu Val Gly Val 
105 

Leu Ala Val Arg Lys 
120 

Lys Lys Tyr Ser Pro 
135 

Arg Ser Phe Ser Phe 

150 

Glu 



45 

Glu Leu lie Gin Gin Thr 
60 

Ala Ala Trp Asp Glu Thr 
75 80 
Gin Gin Met Asn Asn Leu 
95 

Glu Glu lie Ala Leu Met 
110 

Tyr Phe Arg Arg lie Thr 
125 

Cys Ala Trp Glu Val Val 
140 

Ser Thr Asn Leu Gin Glu 
155 160 



<210> 38 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25Epil9 
<400> 38 

Cys Asp Leu Pro Gin Thr His Ser Leu Ser Asn Arg Arg Thr Leu Met 

15 10 15 

Leu Met Ala Gin Met Arg Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Glu Glu Glu Phe Asp Gly His Gin Ala 

35 40 45 

Gin Lys Val Gin Ala lie Phe Leu Leu Tyr Glu Leu lie Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 . 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asn Leu 

85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu lie Ala Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Arg Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 39 

<211> 166 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25Epi20 

<400> 39 
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Cys Asp Leu Pro Gin Thr His Ser Leu Ser Asn Arg Arg Thr Leu Met 

1 5 10 15 

Leu Met Ala Gin Met Arg Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Glu Glu Glu Phe Asp Gly His Gin Ala 

35 40 45 

Gin Lys Val Gin Ala lie Ser Leu Leu Tyr Glu Leu lie Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asn Leu 

85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu lie Ala Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Arg Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 40 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25Epi25 
<400> 40 

Cys Asp Leu Pro Gin Thr His Ser Leu Ser Asn Arg Arg Thr Leu Met 

15 10 15 

Leu Met Ala Gin Met Arg Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Glu Glu Glu Phe Asp Gly His Gin Ala 

35 40 45 

Gin Lys Val Gin Ala lie Ser Val Leu His Glu Leu lie Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asn Leu 

85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu lie Ala Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Arg Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 41 

<211> 166 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Synthetic Construct 25Epi26 
<400> 41 

Cys Asp Leu Pro Gin Thr His Ser Leu Ser Asn Arg Arg Thr Leu Met 

15 10 15 

Leu Met Ala Gin Met Arg Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Glu Glu Glu Phe Asp Gly His Gin Ala 

35 40 45 

Gin Lys Val Gin Ala lie Ser Val Leu His Glu Leu lie Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu lie Ala Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Arg Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 42 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25Epi27 
<400> 42 

Cys Asp Leu Pro Gin Thr His Ser Leu Ser Asn Arg Arg Thr Leu Met 

1 5 10 15 

Leu Met Ala Gin Met Arg Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Glu Glu Glu Phe Asp Gly His Gin Ala 

35 40 45 

Gin Lys Val Gin Ala lie Ser Val Leu His Glu Leu lie Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Ala Gin Gin Met Asn Asn Leu 

85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu lie Ala Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 
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<210> 43 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25Epi28 



<400> 43 
Cys Asp Leu Pro 
1 

Leu Met Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Val Gin 
50 

Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Ser Leu Arg Ser 



Gin Thr His Ser 
5 

Met Arg Arg lie 

Gly Phe Pro Glu 
40 

Ala lie Ser Val 
55 

Ser Thr Lys Asp 
70 

Phe Tyr lie Glu 
85 

lie Gin Glu Val 

lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



Leu Ser Asn Arg 
10 

Ser Pro Phe Ser 
25 

Glu Glu Phe Asp 

Leu His Glu Leu 
60 

Ser Ser Ala Ala 
75 

Leu Tyr Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Phe Ser Thr 
155 



Arg Thr Leu Met 
15 

Cys Leu Lys Asp 
30 

Gly His Gin Ala 
45 

lie Gin Gin Thr 

Trp Asp Glu Thr 
80 

Leu Asn Asp Leu 
95 

lie Ala Leu Met 
110 

Gin Arg lie Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Glu 
160 



<210> 44 
<211> 166 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 

<400> 44 
Cys Asp Leu Pro 
1 

Leu Met Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Thr Gin 
50 

Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 



Sequence 



Construct 25Epi: 

Gin Thr His Ser 
5 

Met Arg Arg lie 

Gly Phe Pro Glu 
40 

Ala lie Ser Val 
55 

Ser Thr Lys Asp 
70 

Phe Tyr lie Glu 
85 

lie Gin Glu Val 

lie Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 



9 



Leu Ser Asn Arg 
10 

Ser Pro Phe Ser 
25 

Glu Glu Phe Asp 

Leu His Glu Leu 
60 

Ser Ser Ala Ala 
75 

Leu Phe Gin Gin 
90 

Gly Val Glu Glu 
105 

Arg Lys Tyr Phe 

Ser Pro Cys Ala 
140 



Arg Thr Leu Met 
15 

Cys Leu Lys Asp 
30 

Gly His Gin Phe 
45 

lie Gin Gin Thr 

Trp Asp Glu Thr 
80 

Met Asn Asn Leu 
95 

lie Ala Leu Met 
110 

Arg Arg lie Thr 
125 

Trp Glu Val Val 
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Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Asp 
145 150 155 160 

Ser Leu Arg Ser Lys Glu 
165 



<210> 45 
<211> 166 
<212> PRT 

<213> homo sapiens 
<400> 45 

Cys Asp . Leu Pro Glu Thr His Ser Leu Asp Asn Arg Arg Thr Leu Met 

15 10 15 

Leu Leu Ala Gin Met Ser Arg lie Ser Pro Ser Ser Cys Leu Met Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

35 40 45 

Gin Lys Ala Pro Ala lie Ser Val Leu His Glu Leu lie Gin Gin lie 

50 55 60 

Phe Asn Leu Phe Thr Thr. Lys Asp Ser Ser Ala Ala Trp Asp Glu Asp 
65 70 75 80 

Leu Leu Asp Lys Phe Cys Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Glu Arg Val Gly Glu Thr Pro Leu Met 

100 105 110 

Asn Ala Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Arg Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Leu Ser Leu Ser Thr Asn Leu Gin Glu 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 



<210> 46 
<211> 165 
<212> PRT 

<213> homo sapiens 



<400> 46 
Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Lys Ala Glu Thr 
50 

Asn Leu Phe Ser 
65 

Leu Asp Lys Phe 

Ala Cys Val lie 
100 

Glu Asp Ser lie 
115 

Tyr Leu Lys Glu 
130 

Ala Glu lie Met 



Gin Thr His Ser 
5 

Met Arg Arg lie 

Gly Phe Pro Gin 
40 

lie Pro Val Leu 
55 

Thr Lys Asp Ser 
70 

Tyr Thr Glu Leu 
85 

Gin Gly Val Gly 

Leu Ala Val Arg 
120 

Lys Lys Tyr Ser 
135 

Arg Ser Phe Ser 



Leu Gly Ser Arg 
10 

Ser Leu Phe Ser 
25 

Glu Glu Phe Gly 

His Glu Met lie 
60 

Ser Ala Ala Trp 
75 

Tyr Gin Gin Leu 
90 

Val Thr Glu Thr 
105 

Lys Tyr Phe Gin 

Pro Cys Ala Trp 
140 

Leu Ser Thr Asn 



Arg Thr Leu Met 
15 

Cys Leu Lys Asp 
30 

Asn Gin Phe Gin 
45 

Gin Gin lie Phe 

Asp Glu Thr Leu 
80 

Asn Asp Leu Glu 
95 

Pro Leu Met Lys 
110 

Arg lie Thr Leu 
125 

Glu Val Val Arg 
Leu Gin Glu Ser 
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145 150 
Leu Arg Ser Lys Glu 
165 



155 



160 



<210> 47 
<211> 165 
<212> PRT 

<213> homo sapiens 
<400> 47 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Ser Arg Arg Thr Leu Met 

1 5 10 15 

Leu Leu Ala Gin Met Arg Lys lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Gly Asn Gin Phe Gin 

35 40 45 

Lys Ala Glu Thr lie Pro Val Leu His Glu Met lie Gin Gin lie Phe 

50 55 60 

Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr Leu 
65 70 75 80 

Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu Glu 

85 90 95 

Ala Cys Val lie Gin Gly Val Gly Val Thr Glu Thr Pro Leu Met Lys 

100 105 110 

Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr Leu 

115 120 125 

Tyr Leu Lys Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val Arg 

130 135 140 

Ala Glu lie Met Arg Ser Phe Ser Leu Ser Thr Asn Leu Gin Glu Ser 
145 150 155 160 

Leu Arg Ser Lys Glu 
165 



<210> 48 
<211> 166 
<212> PRT 

<213> homo sapiens 
<400> 48 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 

1 5 10 15 

Leu Leu Ala Gin Met Gly Arg lie Ser His Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Glu Glu Glu Phe Asp Gly His Gin Phe 

35 40 45 

Gin Lys Thr Gin Ala lie Ser Val Leu His Glu Met lie Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Glu Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Leu Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 
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Arg Leu Arg Arg Lys Asp 
165 



<210> 49 
<211> 166 
<212> PRT 

<213> homo sapiens 
<400> 49 

Cys Asp Leu Pro Gin Thr His Ser Leu Ser Asn Arg Arg Thr Leu Met 

1 5 10 15 

lie Met Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

35 40 45 

Gin Lys Ala Gin Ala lie Ser Val Leu His Glu Met lie Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Met Met Gin Glu Val Gly Val Glu Asp Thr Pro Leu Met 

100 105 110 

Asn Val Asp Ser lie Leu Thr Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Leu Ser Ala Asn Leu Gin Glu 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 



<210> 50 
<211> 166 
<212> PRT 

<213> homo sapiens 
<400> 50 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 

15 10 15 

Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

35 40 45 

Gin Lys Ala Glu Ala lie Ser Val Leu His Glu Val lie Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Val Ala Trp Asp Glu Arg 
65 70 75 80 

Leu Leu Asp Lys Leu Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Val Trp Val Gly Gly Thr Pro Leu Met 

100 105 110 

Asn Glu Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Ser Ser Arg Asn Leu Gin Glu 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
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165 



<210> 51 
<211> 166 
<212> PRT 

<213> homo sapiens 
<400> 51 

Cys Asp Leu Pro Gin Thr His Ser Leu Arg Asn Arg Arg Ala Leu lie 

15 10 15 

Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Glu Phe Arg Phe Pro Glu Glu Glu Phe Asp Gly His Gin Phe 

35 40 45 

Gin Lys Thr Gin Ala lie Ser Val Leu His Glu Met lie Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Glu Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Glu Asp Phe lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 

115 120 125 

Leu Tyr Leu Met Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

.130 135 140 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Lys Lys 
145 150 155 160 

Gly Leu Arg Arg Lys Asp 
165 



<210> 52 
<211> 166 
<212> PRT 

<213> homo sapiens 



<400> 52 
Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Ala Gin 
50 

Phe Asn Leu Phe 
65 

Leu Leu Asp Glu 

Glu Ser Cys Val 
100 

Tyr Glu Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Arg Leu Lys Ser 



Gin Thr His Ser 
5 

Met Arg Arg lie 

Glu Phe Pro Gin 
40 

Ala He Ser Val 
55 

Ser Thr Lys Asp 

70 

Phe Tyr He Glu 
85 

Met Gin Glu Val 

He Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



Leu Gly Asn Arg 
10 

Ser Pro Phe Ser 
25 

Glu Glu Phe Asp 

Leu His Glu Met 
60 

Ser Ser Ala Ala 
75 

Leu Asp Gin Gin 
90 

Gly Val He Glu 
105 

Arg Lys Tyr Phe 

Ser Ser Cys Ala 
140 

Ser Leu Ser He 
155 



Arg Ala Leu He 
15 

Cys Leu Lys Asp 
30 

Asp Lys Gin Phe 
45 

He Gin Gin Thr 

Leu Asp Glu Thr 
80 

Leu Asn Asp Leu 
95 

Ser Pro Leu Met 
110 

Gin Arg He Thr 
125 

Trp Glu Val Val 

Asn Leu Gin Lys 
160 
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<210> 53 
<211> 166 
<212> PRT 

<213> homo sapiens 
<400> 53 



Cys 


Asp 


Leu 


Pro 


Gin 


Thr 


His 


Ser 


Leu 


Gly 


Asn 


Arg 


Arg 


Ala 


Leu 


He 


1 








5 










10 










15 




Leu 


Leu 


Gly 


Gin 


Met 


Gly Arg 


He 


Ser 


Pro 


Phe 


Ser 


Cys 


Leu 


Lys 


Asp 








20 










25 










30 






Ara 
rtj. y 


His 


Asp 


Phe 


Arg 


He 


Pro 


Gin 


Glu 


Glu 


Phe 


Asp 


Glv 


Asn 


Gin 


Phe 






35 










40 










45 








Gin 


Lys 


Ala 


Gin 


Ala 


He 


Ser 


Val 


Leu 


His 


Glu 


Met 


He 


Gin 


Gin 


Thr 




50 










55 










60 










Phe 


Asn 


Leu 


Phe 


Ser 


Thr 


Glu Asp 


Ser 


Ser 


Ala 


Ala 


Trp 


Glu 


Gin 


Ser 


65 










70 










75 










80 


Leu 


Leu 


Glu 


Lys 


Phe 


Ser 


Thr 


Glu 


Leu 


Tyr 


Gin 


Gin 


Leu 


Asn 


Asp 


Leu 










85 










90 










95 




Glu 


Ala 


Cys 


Val 


He 


Gin 


Glu 


Val 


Gly 


Val 


Glu 


Glu 


Thr 


Pro 


Leu 


Met 








100 










105 










110 






Asn 


Glu 


Asp 


Ser 


He 


Leu 


Ala 


Val 


Arg 


Lys 


Tyr 


Phe 


Gin 


Arg 


He 


Thr 






115 










120 










125 








Leu 


Tyr 


Leu 


He 


Glu 


Arg 


Lys 


Tyr 


Ser 


Pro 


Cys 


Ala 


Trp 


Glu 


Val 


Val 




130 










135 










140 










Arg 


Ala 


Glu 


He 


Met 


Arg 


Ser 


Leu 


Ser 


Phe 


Ser 


Thr 


Asn 


Leu 


Gin 


Lys 


145 










150 










155 










160 


Arg 


Leu 


Arg 


Arg 


Lys 


Asp 























165 



<210> 54 
<211> 166 
<212> PRT 

<213> homo sapiens 
<400> 54 

Cys Asn Leu Ser Gin Thr His Ser Leu Asn Asn Arg Arg Thr Leu Met 

15 10 15 

Leu Met Ala Gin Met Arg Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Glu Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met Met Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 

85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 

Asn Glu Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 

115 120 125 

Leu Tyr Leu Met Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Asp 
165 
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<210> 55 

<211> 166 

<212> PRT 

<213> homo sapiens 



<400> 55 



Cys 


Asp 


Leu 


Pro 


Gin 


Thr 


His 


Ser 


Leu 


Gly 


Asn 


Arg 


Arg 


Ala 


Leu 


He 


1 








5 










10 










15 




Leu 


Leu 


Ala 


Gin 


Met 


Gly 


Arg 


He 


Ser 


His 


Phe 


Ser 


Cys 


Leu 


Lys 


Asp 








20 










25 










30 






Arg 


Tyr 


Asp 


Phe 


Gly 


Phe 


Pro 


Gin 


Glu 


Val 


Phe 


Asp 


Gly 


Asn 


Gin 


Phe 






35 










40 










45 








Gin 


Lys 


Ala 


Gin 


Ala 


He 


Ser 


Ala 


Phe 


His 


Glu 


Met 


He 


Gin 


Gin 


Thr 




50 










55 










60 










Phe 


Asn 


Leu 


Phe 


Ser 


Thr 


Lys 


Asp 


Ser 


Ser 


Ala 


Ala 


Trp 


Asp 


Glu 


Thr 


65 










70 










75 










80 


Leu 


Leu 


Asp 


Lys 


Phe 


Tyr 


He 


Glu 


Leu 


Phe 


Gin 


Gin 


Leu 


Asn 


Asp 


Leu 










85 










90 










95 




Glu 


Ala 


Cys 


Val 


Thr 


Gin 


Glu 


Val 


Gly 


Val 


Glu 


Glu 


He 


Ala 


Leu 


Met 








100 










105 










110 






Asn 


Glu 


Asp 


Ser 


He 


Leu 


Ala 


Val 


Arg 


Lys 


Tyr 


Phe 


Gin 


Arg 


He 


Thr 






115 










120 










125 








Leu 


Tyr 


Leu 


Met 


Gly 


Lys 


Lys 


Tyr 


Ser 


Pro 


Cys 


Ala 


Trp 


Glu 


Val 


Val 




130 










135 










140 










Arg 


Ala 


Glu 


He 


Met 


Arg 


Ser 


Phe 


Ser 


Phe 


Ser 


Thr 


Asn 


Leu 


Gin 


Lys 


145 










150 










155 










160 


Gly 


Leu 


Arg 


Arg 


Lys 


Asp 























165 



<210> 56 

<211> 166 

<212> PRT 

<213> homo sapiens 



<400> 56 



Cys 


Asp 


Leu 


Pro 


Gin 


Thr 


His 


Ser 


Leu 


Gly 


Asn 


Arg 


Arg 


Ala 


Leu 


He 


1 








5 










10 










15 




Leu 


Leu 


Ala 


Gin 


Met 


Gly Arg 


He 


Ser 


Pro 


Phe 


Ser 


Cys 


Leu 


Lys 


Asp 








20 










25 










30 






Arg 


His 


Asp 


Phe 


Gly 


Leu 


Pro 


Gin 


Glu 


Glu 


Phe 


Asp 


Gly 


Asn 


Gin 


Phe 






35 










40 










45 








Gin 


Lys 


Thr 


Gin 


Ala 


He 


Ser 


Val 


Leu 


His 


Glu 


Met 


He 


Gin 


Gin 


Thr 




50 










55 










60 










Phe 


Asn 


Leu 


Phe 


Ser 


Thr 


Glu 


Asp 


Ser 


Ser 


Ala 


Ala 


Trp 


Glu 


Gin 


Ser 


65 










70 










75 










80 


Leu 


Leu 


Glu 


Lys 


Phe 


Ser 


Thr 


Glu 


Leu 


Tyr 


Gin 


Gin 


Leu 


Asn 


Asn 


Leu 










85 










90 










95 




Glu 


Ala 


Cys 


Val 


He 


Gin 


Glu 


Val 


Gly 


Met 


Glu 


Glu 


Thr 


Pro 


Leu 


Met 








100 










105 










110 






Asn 


Glu 


Asp 


Ser 


He 


Leu 


Ala 


Val 


Arg 


Lys 


Tyr 


Phe 


Gin 


Arg 


He 


Thr 






115 










120 










125 








Leu 


Tyr 


Leu 


Thr 


Glu 


Lys 


Lys 


Tyr 


Ser 


Pro 


Cys 


Ala 


Trp 


Glu 


Val 


Val 




130 










135 










140 










Arg 


Ala 


Glu 


He 


Met 


Arg 


Ser 


Leu 


Ser 


Phe 


Ser 


Thr 


Asn 


Leu 


Gin 


Lys 


145 










150 










155 










160 


He 


Leu 


Arg 


Arg 


Lys 


Asp 























165 
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<210> 57 
<211> 166 
<212> PRT 

<213> homo sapiens 



<400> 57 
Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Gin Lys Ala Gin 
50 

Phe Asn Leu Phe 
65 

Leu Leu Glu Lys 

Glu Ala Cys Val 
100 

Asn Val Asp Ser 
115 

Leu Tyr Leu Thr 
130 

Arg Ala Glu lie 
145 

Arg Leu Arg Arg 



Gin Thr His Ser 
5 

Met Gly Arg lie 

Gly Phe Pro Gin 
40 

Ala lie Ser Val 
55 

Ser Thr Lys Asp 
70 

Phe Ser Thr Glu 
85 

lie Gin Glu Val 

He Leu Ala Val 
120 

Glu Lys Lys Tyr 
135 

Met Arg Ser Phe 

150 
Lys Glu 
165 



Leu Gly Asn Arg 
10 

Ser Pro Phe Ser 
25 

Glu Glu Phe Asp 

Leu His Glu Met 
60 

Ser Ser Ala Thr 
75 

Leu Asn Gin Gin 
90 

Gly Val Glu Glu 
105 

Lys Lys Tyr Phe 

Ser Pro Cys Ala 
140 

Ser Leu Ser Lys 
155 



Arg Ala Leu He 
15 

Cys Leu Lys Asp 
30 

Gly Asn Gin Phe 
45 

He Gin Gin Thr 

Trp Glu Gin Ser 
80 

Leu Asn Asp Leu 
95 

Thr Pro Leu Met 
110 

Gin Arg lie Thr 
125 

Trp Glu Val Val 

He Phe Gin Glu 
160 



<210> 58 

<211> 167 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct IFN-alpha Conl 
<400> 58 

Met Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu 

15 10 15 

He Leu Leu Ala Gin Met Arg Arg He Ser Pro Phe Ser Cys Leu Lys 

20 25 30 

Asp Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin 

35 40 45 

Phe Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin 

50 55 60 

Thr Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu 
65 70 75 80 

Ser Leu Leu Glu Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp 

85 90 95 

Leu Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu 

100 105 110 

Met Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He 

115 120 125 

Thr Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val 

130 135 140 

Val Arg Ala Glu He Met Arg Ser Phe Ser Leu Ser Thr Asn Leu Gin 
145 150 155 160 

Glu Arg Leu Arg Arg Lys Glu 
165 
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<210> 59 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epil5cl 



<400> 59 

tgtgatctgc 

atgaggagaa 

gaggagtttg 

atgcagcaga 

ctcctagaaa 

atgcaggagg 

aggaaatact 

tgggaggttg 

agcttacgca 



ctcagaccca 
tctctctttt 
atggcaacca 
ccttcaacct 
aattctacat 
ttggagtgga 
ttcaaagaat 
tcagagcaga 
gcaaggaa 



cagcctgggt 
ctcctgtctg 
cctccagaag 
cttcagcaca 
tgaacttttc 
agagactccc 
cactctttat 
aatcatgaga 



cacaggagga 
aaggacagac 
gttcaagcta 
aaggactcat 
cagcaaatga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccatgatgct 
atgacttcag 
tcttcctttt 
ctgctgcttg 
atgacctgga 
tggactccat 
agaagtatag 
tttcaacaaa 



cctggcacaa 60 
atttccccag 120 
ctatgagatg 180 
ggatgagacc 240 
agcctgcgtg 300 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaagaa 480 
498 



<210> 60 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epil5c2 



<400> 60 

atgtgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgcgtaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccttcaa 
aaaaattcta 
aggttggagt 
actttcaacg 
ttgtccgcgc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacctccag 
cctcttcagc 
cattgaactt 
ggaagagact 
catcactctt 
agaaatcatg 



ggtcaccgtc 
ctgaaggacc 
aaggttcaag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
cgctctttct 



gcaccatgat 
gccatgactt 
ctatcttcct 
catctgctgc 
tgaatgacct 
atgtggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
ccgctttccc 120 
tttctatgag 180 
ttgggatgag 24 0 
ggaagcctgc 3 00 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 61 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epil5c3 



<400> 61 

atgtgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgaggaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccttcaa 
aaaaattcta 
aggttggagt 
actttcaaag 
ttgtcagagc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacctccag 
cctcttcagc 
cattgaactt 
ggaagagact 
aatcactctt 
agaaatcatg 
a 



ggtcaccgtc 
ctgaaggaca 
aaggttcaag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
agatctttct 



gcaccatgat 
gacatgactt 
ctatcttcct 
catctgctgc 
tgaatgacct 
atgtggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
cagatttccc 120 
tttctatgag 180 
ttgggatgag 24 0 
ggaagcctgc 300 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 62 
<211> 498 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epil8cl 



<400> 62 

tgtgatctgc 

atgaggagaa 

gaggagtttg 

atgcagcaga 

ctcctagaaa 

atgcaggagg 

aggaaatact 

tgggaggttg 

agcttacgca 



ctcagaccca 
tctctctttt 
atggcaacca 
ccttcaacct 
aattctacat 
ttggagtgga 
ttcaaagaat 
tcagagcaga 
gcaaggaa 



cagcctgggt 
ctcctgtctg 
cgcccagaag 
cttcagcaca 
tgaacttttc 
agagactccc 
cactctttat 
aatcatgaga 



cacaggagga 
aaggacagac 
gttcaagcta 
aaggactcat 
cagcaaatga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccatgatgct 
atgacttcag 
tcttcctttt 
ctgctgcttg 
atgacctgga 
tggactccat 
agaagtatag 
tttcaacaaa 



cctggcacaa 
atttccccag 
ctatgagatg 
ggatgagacc 
agcctgcgtg 
cctggctgtg 
cccttgtgcc 
cttgcaagaa 



60 

120 

180 

240 

300 

360 

420 

480 

498 



<210> 63 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epil8c2 



<400> 63 

agttgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgcgtaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccttcaa 
aaaaattcta 
aggttggagt 
actttcaacg 
ttgtccgcgc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacgcccag 
cctcttcagc 
cattgaactt 
ggaagagact 
catcactctt 
agaaatcatg 
a 



ggtcaccgtc 
ctgaaggacc 
aaggttcaag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
cgctctttct 



gcaccatgat 
gccatgactt 
ctatcttcct 
catctgctgc 
tgaatgacct 
atgtggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
ccgctttccc 120 
tttctatgag 180 
ttgggatgag 24 0 
ggaagcctgc 30 0 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 64 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epil8c3 



<400> 64 

agttgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgaggaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccttcaa 
aaaaattcta 
aggttggagt 
actttcaaag 
ttgtcagagc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacgcccag 
cctcttcagc 
cattgaactt 
ggaagagact 
aatcactctt 
agaaatcatg 
a 



ggtcaccgtc 
ctgaaggaca 
aaggttcaag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
agatctttct 



gcaccatgat 
gacatgactt 
ctatcttcct 
catctgctgc 
tgaatgacct 
atgtggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
cagatttccc 120 
tttctatgag 180 
ttgggatgag 240 
ggaagcctgc 3 00 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 65 

<211> 498 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Synthetic Construct 14epi20cl 



<400> 65 

tgtgatctgc 

atgaggagaa 

gaggagtttg 

atgcagcaga 

ctcctagaaa 

atgcaggagg 

aggaaatact 

tgggaggttg 

agcttacgca 



ctcagaccca 
tctctctttt 
atggcaacca 
ccttcaacct 
aattctacat 
ttggagtgga 
ttcaaagaat 
tcagagcaga 
gcaaggaa 



cagcctgggt 
ctcctgtctg 
cgcccagaag 
cttcagcaca 
tgaacttttc 
agagactccc 
cactctttat 
aatcatgaga 



cacaggagga 
aaggacagac 
gttcaagcta 
aaggactcat 
cagcaaatga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ccatgatgct 
atgacttcag 
tctctctttt 
ctgctgcttg 
atgacctgga 
tggactccat 
agaagtatag 
tttcaacaaa 



cctggcacaa 60 
atttccccag 120 
ctatgagatg 180 
ggatgagacc 24 0 
agcctgcgtg 3 00 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaagaa 480 
498 



<210> 66 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epi20c2 



<400> 66 

atgtgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgcgtaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccttcaa 
aaaaattcta 
aggttggagt 
actttcaacg 
ttgtccgcgc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacgcccag 
cctcttcagc 
cattgaactt 
ggaagagact 
catcactctt 
agaaatcatg 
a 



ggtcaccgtc 
ctgaaggacc 
aaggttcaag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
cgctctttct 



gcaccatgat 
gccatgactt 
ctatctctct 
catctgctgc 
tgaatgacct 
atgtggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
ccgctttccc 120 
tttctatgag 180 
ttgggatgag 240 
ggaagcctgc 300 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 67 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epi20c3 



<400> 67 

atgtgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgaggaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccttcaa 
aaaaattcta 
aggttggagt 
actttcaaag 
ttgtcagagc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacgcccag 
cctcttcagc 
cattgaactt 
ggaagagact 
aatcactctt 
agaaatcatg 
a 



ggtcaccgtc 
ctgaaggaca 
aaggttcaag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
agatctttct 



gcaccatgat 
gacatgactt 
ctatctctct 
catctgctgc 
tgaatgacct 
atgtggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
cagatttccc 120 
tttctatgag 180 
ttgggatgag 24 0 
ggaagcctgc 3 00 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 68 

<211> 498 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct 14epi28cl 



<400> 68 

tgtgatctgc ctcagaccca cagcctgggt cacaggagga ccatgatgct cctggcacaa 60 
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atgaggagaa 
gaggagtttg 
atgcagcaga 
ctcctagaaa 
atgcaggagg 
aggaaatact 
tgggaggttg 
agcttacgca 



tctctctttt 
atggcaacca 
ccgctaacct 
aattctacat 
ttggagtgga 
ttcaaagaat 
tcagagcaga 
gcaaggaa 



ctcctgtctg 
cgcccagaag 
cttcagcaca 
tgaacttttc 
agagactccc 
cactctttat 
aatcatgaga 



aaggacagac 
gttcaagcta 
aaggactcat 
cagcaaatga 
ctgatgaagg 
ctgacagaga 
tctttctctt 



atgacttcag 
tctctctttt 
ctgctgcttg 
atgacctgga 
aggactccat 
agaagtatag 
tttcaacaaa 



atttccccag 120 
ctatgagatg 180 
ggatgagacc 24 0 
agcctgcgtg 3 00 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaagaa 480 
498 



<210> 69 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epi28c2 



<400> 69 

atgtgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgcgtaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccgctaa 
aaaaattcta 
aggttggagt 
actttcaacg 
ttgtccgcgc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacgcccag 
cctcttcagc 
cattgaactt 
ggaagagact 
catcactctt 
agaaatcatg 



ggtcaccgtc 
ctgaaggacc 
aaggttcaag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
cgctctttct 



gcaccatgat 
gccatgactt 
ctatctctct 
catctgctgc 
tgaatgacct 
aggaggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
ccgctttccc 120 
tttctatgag 180 
ttgggatgag 24 0 
ggaagcctgc 3 00 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 70 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epi28c3 



<400> 70 

atgtgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgaggaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccgctaa 
aaaaattcta 
aggttggagt 
actttcaaag 
ttgtcagagc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacgcccag 
cctcttcagc 
cattgaactt 
ggaagagact 
aatcactctt 
agaaatcatg 



ggtcaccgtc 
ctgaaggaca 
aaggttcaag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
agatctttct 



gcaccatgat 
gacatgactt 
ctatctctct 
catctgctgc 
tgaatgacct 
aggaggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
cagatttccc 120 
tttctatgag 180 
ttgggatgag 24 0 
ggaagcctgc 3 00 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 71 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epi32cl 



<400> 71 

tgtgatctgc ctcagaccca cagcctgggt 
atgaggagaa tctctctttt ctcctgtctg 
gaggagtttg atggcaacca cctccagaag 
atgcagcaga ccttcaacct cttcagcaca 
ctcctagaaa aattctacat tgaacttttc 



cacaggagga ccatgatgct cctggcacaa 60 
aaggacagac atgacttcag atttccccag 120 
gttccagcta tctctctttt ctatgagatg 180 
aaggactcat ctgctgcttg ggatgagacc 240 
cagcaaatga atgacctgga agcctgcgtg 3 00 
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atgcaggagg ttggagtgga agagactccc ctgatgaagg aggactccat cctggctgtg 3 60 
aggaaatact ttcaaagaat cactctttat ctgacagaga agaagtatag cccttgtgcc 420 
t 999 a 99 tt 9 tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaagaa 480 
agcttacgca gcaaggaa 498 

<210> 72 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epi32c2 



<400> 72 

atgtgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgcgtaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccttcaa 
aaaaattcta 
aggttggagt 
actttcaacg 
ttgtccgcgc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacctccag 
cctcttcagc 
cattgaactt 
ggaagagact 
catcactctt 
agaaatcatg 



ggtcaccgtc 
ctgaaggacc 
aaggttccag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
cgctctttct 



gcaccatgat 
gccatgactt 
ctatctctct 
catctgctgc 
tgaatgacct 
aggaggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
ccgctttccc 120 
tttctatgag 180 
ttgggatgag 24 0 
ggaagcctgc 3 00 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 73 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epi32c3 



<400> 73 

atgtgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgaggaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccttcaa 
aaaaattcta 
aggttggagt 
actttcaaag 
ttgtcagagc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacctccag 
cctcttcagc 
cattgaactt 
ggaagagact 
aatcactctt 
agaaatcatg 
a 



ggtcaccgtc 
ctgaaggaca 
aaggttccag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
agatctttct 



gcaccatgat 
gacatgactt 
ctatctctct 
catctgctgc 
tgaatgacct 
aggaggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
cagatttccc 120 
tttctatgag 180 
ttgggatgag 240 
ggaagcctgc 3 00 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 74 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epi35cl 



<400> 74 

tgtgatctgc 

atgaggagaa 

gaggagtttg 

atgcagcaga 

ctcctagaaa 

atgcaggagg 

aggaaatact 

tgggaggttg 

agcttacgca 



ctcagaccca 
tctctctttt 
atggcaacca 
ccttcaacct 
aattctacat 
ttggagtgga 
ttcaaagaat 
tcagagcaga 
gcaaggaa 



cagcctgggt 
ctcctgtctg 
cctccagaag 
cttcagcaca 
tgaacttttc 
agagactccc 
cactctttat 
aatcatgaga 



cacaggagga 
aaggacagac 
gctccagcca 
aaggactcat 
cagcaaatga 
ctgatgaagg 
ctgacagaga 
tctttctctt 



ccatgatgct 
atgacttcag 
tctctgtcct 
ctgctgcttg 
atgacctgga 
aggactccat 
agaagtatag 
tttcaacaaa 



cctggcacaa 60 
atttccccag 120 
ccatgagatg 180 
ggatgagacc 24 0 
agcctgcgtg 300 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaagaa 480 
498 
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<210> 75 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epi35c2 



<400> 75 

atgtgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgcgtaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccttcaa 
aaaaattcta 
aggttggagt 
actttcaacg 
ttgtccgcgc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacctccag 
cctcttcagc 
cattgaactt 
ggaagagact 
catcactctt 
agaaatcatg 
a 



ggtcaccgtc 
ctgaaggacc 
aaggctccag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
cgctctttct 



gcaccatgat 
gccatgactt 
ccatctctgt 
catctgctgc 
tgaatgacct 
aggaggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
ccgctttccc 120 
cctccatgag 180 
ttgggatgag 24 0 
ggaagcctgc 3 00 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 76 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epi35c3 



<400> 76 

atgtgtgatc 

caaatgcgtc 

caggaggagt 

atgatgcagc 

accctcctag 

gtgatgcagg 

gtgaggaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctct 
ttgatggcaa 
agaccttcaa 
aaaaattcta 
aggttggagt 
actttcaaag 
ttgtcagagc 
gcagcaagga 



ccacagcctg 
tttctcctgt 
ccacctccag 
cctcttcagc 
cattgaactt 
ggaagagact 
aatcactctt 
agaaatcatg 



ggtcaccgtc 
ctgaaggaca 
aaggctccag 
acaaaggact 
ttccagcaaa 
cccctgatga 
tatctgacag 
agatctttct 



gcaccatgat 
gacatgactt 
ccatctctgt 
catctgctgc 
tgaatgacct 
aggaggactc 
agaagaagta 
ctttttcaac 



gctcctggca 60 
cagatttccc 120 
cctccatgag 180 
ttgggatgag 240 
ggaagcctgc 3 00 
catcctggct 360 
tagcccttgt 420 
aaacttgcaa 480 
501 



<210> 77 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25epil8cl 



<400> 77 

tgtgatctgc 

atgaggagaa 

gaggagtttg 

atccagcaga 

ctcctagaaa 

atacaggagg 

aggaaatact 

tgggaggttg 

agcttacgca 



ctcagaccca 
tctctccttt 
atggccacca 
ccttcaatct 
aattctacat 
ttggggtgga 
tccgtcgcat 
tcagagcaga 
gcaaggaa 



cagcctgagt 
ctcctgcctg 
ggcccagaag 
cttcagcaca 
tgaacttttc 
agagattgcc 
cactctctat 
aatcatgaga 



aacaggagga 
aaggacagac 
actcaagcca 
aaggactcat 
cagcaaatga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ctctgatgct 
atgatttcgg 
tctccgttct 
ctgctgcttg 
ataacctgga 
tggactccat 
agaaatacag 
tttcaacaaa 



catggcacaa 60 
attccccgag 120 
ccatgagctg 180 
ggatgagacc 240 
agcatgtgtg 300 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaagaa 480 
498 



<210> 78 
<211> 501 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25epil8c2 



<400> 78 

atgtgtgatc 

caaatgcgtc 

gaggaggagt 

ctgatccagc 

accctcctag 

gtgatacagg 

gtgcgtaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctcc 
ttgatggcca 
agaccttcaa 
aaaaattcta 
aggttggggt 
acttccgtcg 
ttgtccgcgc 
gcagcaagga 



ccacagcctg 
tttctcctgc 
ccaggcccag 
tctcttcagc 
cattgaactt 
ggaagagatt 
catcactctc 
agaaatcatg 



agtaaccgtc 
ctgaaggacc 
aagactcaag 
acaaaggact 
ttccagcaaa 
gccctgatga 
tatctgacag 
cgctctttct 



gcactctgat 
gccatgattt 
ccatctccgt 
catctgctgc 
tgaataacct 
atgtggactc 
agaagaaata 
ctttttcaac 



gctcatggca 60 
cggattcccc 120 
tctccatgag 180 
ttgggatgag 24 0 
ggaagcatgt 300 
catcctggct 360 
cagcccttgt 420 
aaacttgcaa 480 
501 



<210> 79 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25epil8c3 



<400> 79 

atgtgtgatc 

caaatgcgtc 

gaggaggagt 

ctgatccagc 

accctcctag 

gtgatacagg 

gtgaggaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctcc 
ttgatggcca 
agaccttcaa 
aaaaattcta 
aggttggggt 
acttccgtcg 
ttgtcagagc 
gcagcaagga 



ccacagcctg 
tttctcctgc 
ccaggcccag 
tctcttcagc 
cattgaactt 
ggaagagatt 
catcactctc 
agaaatcatg 



agtaaccgtc 
ctgaaggaca 
aagactcaag 
acaaaggact 
ttccagcaaa 
gccctgatga 
tatctgacag 
agatctttct 



gcactctgat 
gacatgattt 
ccatctccgt 
catctgctgc 
tgaataacct 
atgtggactc 
agaagaaata 
ctttttcaac 



gctcatggca 60 
cggattcccc 120 
tctccatgag 180 
ttgggatgag 24 0 
ggaagcatgt 3 00 
catcctggct 360 
cagcccttgt 420 
aaacttgcaa 48 0 
501 



<210> 80 

<211> 498 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25epil9cl 



<400> 80 

tgtgatctgc 

atgaggagaa 

gaggagtttg 

atccagcaga 

ctcctagaaa 

atacaggagg 

aggaaatact 

tgggaggttg 

agcttacgca 



ctcagaccca 
tctctccttt 
atggccacca 
ccttcaatct 
aattctacat 
ttggggtgga 
tccgtcgcat 
tcagagcaga 
gcaaggaa 



cagcctgagt 
ctcctgcctg 
ggcccagaag 
cttcagcaca 
tgaacttttc 
agagattgcc 
cactctctat 
aatcatgaga 



aacaggagga 
aaggacagac 
gttcaagcca 
aaggactcat 
cagcaaatga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ctctgatgct 
atgatttcgg 
tcttccttct 
ctgctgcttg 
ataacctgga 
tggactccat 
agaaatacag 
tttcaacaaa 



catggcacaa 60 
attccccgag 12 0 
ctatgagctg 180 
ggatgagacc 240 
agcatgtgtg 3 00 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaagaa 480 
498 



<210> 81 

<211> 501 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct 25epil9c2 
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<400> 81 

atgtgcgacc 

cagatgcgtc 

gaagaggaat 

ctcatccagc 

accctgctgg 

gttatccagg 

gttcgtaaat 

gcttgggaag 

gaatccctgc 



tgccgcagac 
gcatttcccc 
ttgacggtca 
aaaccttcaa 
agaaatttta 
aagttggtgt 
acttccgtcg 
ttgtacgtgc 
gttccaaaga 



ccactccctg 
gttctcctgc 
ccaggctcag 
cctgttctcc 
catcgaactg 
tgaagagatc 
catcaccctg 
tgaaatcatg 



tccaaccgtc 
cttaaggacc 
aaagttcagg 
accaaagact 
ttccagcaaa 
gctctgatga 
tacctgaccg 
agatctttct 



gcaccctgat 
gtcacgactt 
ctatcttcct 
cttccgcggc 
tgaacaatct 
acgttgactc 
aaaaaaagta 
ccttctccac 



gctgatggct 60 
cggtttcccg 120 
gctgtacgag 180 
ttgggacgaa 24 0 
agaagcatgt 3 00 
catcctggct 360 
ctccccgtgc 420 
caacctgcag 480 
501 



<210> 82 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25epil9c3 



<400> 82 

atgtgtgatc 

caaatgcgtc 

gaggaggagt 

ctgatccagc 

accctcctag 

gtgatacagg 

gtgaggaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctcc 
ttgatggcca 
agaccttcaa 
aaaaattcta 
aggttggggt 
acttccgtcg 
ttgtcagagc 
gcagcaagga 



ccacagcctg 
tttctcctgc 
ccaggcccag 
tctcttcagc 
cattgaactt 
ggaagagatt 
catcactctc 
agaaatcatg 
a 



agtaaccgtc 
ctgaaggaca 
aaggttcaag 
acaaaggact 
ttccagcaaa 
gccctgatga 
tatctgacag 
agatctttct 



gcactctgat 
gacatgattt 
ccatcttcct 
catctgctgc 
tgaataacct 
atgtggactc 
agaagaaata 
ctttttcaac 



gctcatggca 60 
cggattcccc 120 
tctctatgag 180 
ttgggatgag 24 0 
ggaagcatgt 3 00 
catcctggct 360 
cagcccttgt 420 
aaacttgcaa 480 
501 



<210> 83 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25epi26cl 



<400> 83 

tgtgatctgc 

atgaggagaa 

gaggagtttg 

atccagcaga 

ctcctagaaa 

atacaggagg 

aggaaatact 

tgggaggttg 

agcttacgca 



ctcagaccca 
tctctccttt 
atggccacca 
ccttcaatct 
aattctacat 
ttggggtgga 
tccgtcgcat 
tcagagcaga 
gcaaggaa 



cagcctgagt 
ctcctgcctg 
ggcccagaag 
cttcagcaca 
tgaactttac 
agagattgcc 
cactctctat 
aatcatgaga 



aacaggagga 
aaggacagac 
gttcaagcca 
aaggactcat 
cagcaactga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ctctgatgct 
atgatttcgg 
tctccgttct 
ctgctgcttg 
atgacctgga 
tggactccat 
agaaatacag 
tttcaacaaa 



catggcacaa 6 0 
attccccgag 120 
ccatgagctg 180 
ggatgagacc 24 0 
agcatgtgtg 30 0 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaagaa 480 
498 



<210> 84 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25epi26c2 



<400> 84 

atgtgtgatc tgcctcagac ccacagcctg agtaaccgtc gcactctgat gctcatggca 60 
caaatgcgtc gcatctctcc tttctcctgc ctgaaggacc gccatgattt cggattcccc 120 
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gaggaggagt ttgatggcca ccaggcccag 
ctgatccagc agaccttcaa tctcttcagc 
accctcctag aaaaattcta cattgaactt 
gtgatacagg aggttggggt ggaagagatt 
gtgcgtaaat acttccgtcg catcactctc 
gcctgggagg ttgtccgcgc agaaatcatg 
gaaagcttac gcagcaagga a 



aaggttcaag ccatctccgt tctccatgag 180 
acaaaggact catctgctgc ttgggatgag 24 0 
taccagcaac tgaatgacct ggaagcatgt 300 
gccctgatga atgtggactc catcctggct 360 
tatctgacag agaagaaata cagcccttgt 420 
cgctctttct ctttttcaac aaacttgcaa 480 

501 



<210> 85 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25epi26c3 



<400> 85 

atgtgtgatc 

caaatgcgtc 

gaggaggagt 

ctgatccagc 

accctcctag 

gtgatacagg 

gtgaggaaat 

gcctgggagg 

gaaagcttac 



tgcctcagac 
gcatctctcc 
ttgatggcca 
agaccttcaa 
aaaaattcta 
aggttggggt 
acttccgtcg 
ttgtcagagc 
gcagcaagga 



ccacagcctg 
tttctcctgc 
ccaggcccag 
tctcttcagc 
cattgaactt 
ggaagagatt 
catcactctc 
agaaatcatg 



agtaaccgtc 
ctgaaggaca 
aaggttcaag 
acaaaggact 
taccagcaac 
gccctgatga 
tatctgacag 
agatctttct 



gcactctgat 
gacatgattt 
ccatctccgt 
catctgctgc 
tgaatgacct 
atgtggactc 
agaagaaata 
ctttttcaac 



gctcatggca 60 
cggattcccc 120 
tctccatgag 180 
ttgggatgag 24 0 
ggaagcatgt 3 00 
catcctggct 360 
cagcccttgt 420 
aaacttgcaa 480 
501 



<210> 86 
<211> 498 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct 25epi29cl 



<400> 86 

tgtgatctgc 

atgaggagaa 

gaggagtttg 

atccagcaga 

ctcctagaaa 

atacaggagg 

aggaaatact 

tgggaggttg 

agcttacgca 



ctcagaccca 
tctctccttt 
atggccacca 
ccttcaatct 
aattctacat 
ttggggtgga 
tccgtcgcat 
tcagagcaga 
gcaaggaa 



cagcctgagt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaacttttc 
agagattgcc 
cactctctat 
aatcatgaga 



aacaggagga 
aaggacagac 
actcaagcca 
aaggactcat 
cagcaaatga 
ctgatgaatg 
ctgacagaga 
tctttctctt 



ctctgatgct 
atgatttcgg 
tctccgttct 
ctgctgcttg 
ataacctgga 
tggactccat 
agaaatacag 
tttcaacaaa 



catggcacaa 60 
attccccgag 120 
ccatgagctg 180 
ggatgagacc 240 
agcatgtgtg 3 00 
cctggctgtg 360 
cccttgtgcc 420 
cttgcaagat 480 
498 



<210> 87 

<211> 501 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic Construct 25epi29c2 



<400> 87 

atgtgcgacc tgccgcagac ccactccctg 
cagatgcgca gaatctcccc gttctcctgc 
gaagaagaat ttgacggtca ccagttccag 
ctgatccagc agaccttcaa cctgttctcc 
accctgctgg aaaaattcta catcgaactg 
gttatccagg aagtaggtgt tgaggaaatc 



tccaaccgtc gtaccctgat gctgatggct 60 
cttaaggacc gtcacgactt cggtttccca 120 
aaaacccagg ctatctccgt tcttcatgaa 180 
accaaagact cctccgcggc ttgggacgaa 240 
tttcagcaga tgaacaacct agaggcatgt 300 
gctctgatga acgttgactc catcctggct 360 
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gtccggaaat acttccgtcg catcaccctg tacctgaccg aaaaaaaata ctccccgtgc 420 
gcttgggaag ttgtgcgcgc tgaaatcatg cgttccttct ccttctccac caacctgcag 480 
gactccctgc gttccaaaga a 501 

<210> 88 
<211> 501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25epi29c3 



<400> 88 

atgtgtgatc 

caaatgcgtc 

gaggaggagt 

ctgatccagc 

accctcctag 

gtgatacagg 

gtgaggaaat 

gcctgggagg 

gatagcttac 



tgcctcagac 
gcatctctcc 
ttgatggcca 
agaccttcaa 
aaaaattcta 
aggttggggt 
acttccgtcg 
ttgtcagagc 
gcagcaagga 



ccacagcctg 
tt/tctcctgc 
ccagttccag 
tctcttcagc 
cattgaactt 
ggaagagatt 
catcactctc 
agaaatcatg 
a 



agtaaccgtc 
ctgaaggaca 
aagactcaag 
acaaaggact 
ttccagcaaa 
gccctgatga 
tatctgacag 
agatctttct 



gcactctgat 
gacatgattt 
ccatctccgt 
catctgctgc 
tgaataacct 
atgtggactc 
agaagaaata 
ctttttcaac 



gctcatggca 60 
cggattcccc 120 
tctccatgag 180 
ttgggatgag 24 0 
ggaagcatgt 3 00 
catcctggct 360 
cagcccttgt 420 
aaacttgcaa 480 
501 



<210> 89 
<211> 579 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 14epil8m2 



<220> 
<221> CDS 
<222> (57) 



(560) 



<400> 89 

caattgtgag cggataacaa tttcacacag aattcattaa agaggagaaa ttaact atg 59 

Met 
1 



tgc gac ctg ccg cag acc cac tec ctt gga cac cgt egg acc atg atg 
Cys Asp Leu Pro Gin Thr His Ser Leu Gly His Arg Arg Thr Met Met 
5 10 15 



107 



ctg eta get cag atg cgt aga ata tec ctg ttc tec tgc ctt aag gac 
Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 
20 25 30 



155 



cgt cac gac ttc cgt ttc ccc cag gaa gaa ttt gat ggt aac cac get 
Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn His Ala 
35 40 45 



203 



cag aaa gtt cag get ate ttc ctg ttt tac gaa atg atg cag caa acc 
Gin Lys Val Gin Ala lie Phe Leu Phe Tyr Glu Met Met Gin Gin Thr 
50 55 60 65 



251 



ttc aac ctg ttc tec acc aaa gac tec tec gcg get tgg gac gaa acc 
Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
70 75 80 



299 



ctg ctg gaa aaa ttc tac ate gaa ctg ttc cag cag atg aac gat eta 347 
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Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asp Leu 
85 90 95 

gaa gca tgc gtt atg cag gaa gta ggt gtt gaa gag acc cca ctg atg 395 
Glu Ala Cys Val Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

aac gtt gac tec ate ctg get gtc egg aaa tac ttc cag cgt ate acc 443 
Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg lie Thr 
115 120 125 

ctg tac ctg act gaa aaa aaa tac tec ccg tgc get tgg gaa gtt gtg 491 
Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 145 

cgc get gaa ate atg cgt tec ttc tec ttc tec acc aac ctg cag gaa 53 9 
Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
150 155 160 

tec ctg cgt tec aaa gaa taa taagcttaat tagctgagc 579 
Ser Leu Arg Ser Lys Glu * 
165 



<210> 90 
<211> 579 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 25epil9m 

<220> 

<221> CDS 

<222> (57) . . . (560) 

<400> 90 

caattgtgag eggataacaa tttcacacag aattcattaa agaggagaaa ttaact atg 59 

Met 
1 

tgc gac ctg ccg cag acc cac tec ctt tct aac cgt aga acc ctg atg 107 
Cys Asp Leu Pro Gin Thr His Ser Leu Ser Asn Arg Arg Thr Leu Met 
5 10 15 

ctg atg get cag atg cgt cgt att tct ccg ttc tct tgc ctt aag gat 155 
Leu Met Ala Gin Met Arg Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 

cgt cac gac ttc ggt ttc cca gaa gag gaa ttt gac ggt cac cag get 203 
Arg His Asp Phe Gly Phe Pro Glu Glu Glu Phe Asp Gly His Gin Ala 
35 40 45 

cag aaa gtt cag get ate ttc ctg ctg tac gag etc ate cag caa acc 251 
Gin Lys Val Gin Ala lie Phe Leu Leu Tyr Glu Leu lie Gin Gin Thr 
50 55 60 65 



ttc aac ctg ttc tct acc aaa gac tct tec gcg get tgg gac gaa acc 
Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
70 75 80 



299 
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ctg ctg gag aaa ttt tac ate gaa ctg ttc cag caa atg aac aat eta 347 
Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Met Asn Asn Leu 
85 90 95 

gaa gca tgt gtt ate cag gaa gtt ggt gtt gaa gag ate get ctg atg 3 95 
Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu lie Ala Leu Met 
100 105 110 

aac gtt gac tec ate ctg get gtt cgt aaa tac ttc cgt cgc ate ace 443 
Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Arg Arg lie Thr 
115 . 120 125 

ctg tac ctg acc gaa aaa aag tac tec ccg tgc get tgg gaa gtt gta 491 
Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 145 

cgt get gaa ate atg aga tct ttc tec ttc tec acc aac ctg cag gaa 539 
Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Glu 
150 155 160 

tec ctg cgt tec aaa gaa taa taagcttaat tagctgagc 579 
Ser Leu Arg Ser Lys Glu * 
165 



